Quick facts

Model ID gemini-3.1-flash-lite

Source Vertex

Context Window 1048576

Pricing $0.25 input / $1.50 output per 1M tokens

Capabilities tool calling, reasoning, structured output, temperature control

Model overview

Gemini 3.1 Flash Lite is an AI model from Vertex with 1048576 token context window and text, image, video, audio, pdf input support.

Published pricing is $0.25 input and $1.50 output per 1M tokens.

Model ID gemini-3.1-flash-lite

Provider Vertex

Family gemini-flash-lite

Status -

Knowledge Cutoff 2025-01

Release Date 2026-05-07

Input Modalities text, image, video, audio, pdf

Output Modalities text

Context Window 1048576

Input Limit -

Output Limit 65536

Tool Calling Yes

Reasoning Yes

Structured Output Yes

Temperature Control Yes

Open Weights No

Input Cost / 1M tokens $0.25

Output Cost / 1M tokens $1.50

Reasoning Cost / 1M tokens -

Cache Read Cost / 1M tokens $0.03

Cache Write Cost / 1M tokens -