whichllm — Browse and compare AI model specs and pricing

Vertex

Gemini 3.1 Flash Lite model ID, context window & pricing

gemini-flash-lite

Quick facts

Model ID gemini-3.1-flash-lite
Source Vertex
Context Window 1048576
Pricing $0.25 input / $1.50 output per 1M tokens
Capabilities tool calling, reasoning, structured output, temperature control

Model overview

Gemini 3.1 Flash Lite is an AI model from Vertex with 1048576 token context window and text, image, video, audio, pdf input support.

Published pricing is $0.25 input and $1.50 output per 1M tokens.

  • Workloads that use text, image, video, audio, pdf inputs with text outputs.
  • Agent and tool workflows that need function calling.
  • Reasoning-heavy prompts where stepwise problem solving matters.
Model ID gemini-3.1-flash-lite
Provider Vertex
Family gemini-flash-lite
Status -
Knowledge Cutoff 2025-01
Release Date 2026-05-07
Input Modalities text, image, video, audio, pdf
Output Modalities text
Context Window 1048576
Input Limit -
Output Limit 65536
Tool Calling Yes
Reasoning Yes
Structured Output Yes
Temperature Control Yes
Open Weights No
Input Cost / 1M tokens $0.25
Output Cost / 1M tokens $1.50
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens $0.03
Cache Write Cost / 1M tokens -