whichllm — Browse and compare AI model specs and pricing

Kilo Gateway

Google: Gemini 3.1 Flash Lite model ID, context window & pricing

models.dev synced record

Quick facts

Model ID google/gemini-3.1-flash-lite
Source Kilo Gateway
Context Window 1048576
Pricing $0.25 input / $1.50 output per 1M tokens
Capabilities tool calling, reasoning, temperature control

Model overview

Google: Gemini 3.1 Flash Lite is an AI model from Kilo Gateway with 1048576 token context window and audio, image, pdf, text, video input support.

Published pricing is $0.25 input and $1.50 output per 1M tokens.

  • Workloads that use audio, image, pdf, text, video inputs with text outputs.
  • Agent and tool workflows that need function calling.
  • Reasoning-heavy prompts where stepwise problem solving matters.
Model ID google/gemini-3.1-flash-lite
Provider Kilo Gateway
Family -
Status -
Knowledge Cutoff -
Release Date 2026-05-07
Input Modalities audio, image, pdf, text, video
Output Modalities text
Context Window 1048576
Input Limit -
Output Limit 65536
Tool Calling Yes
Reasoning Yes
Structured Output -
Temperature Control Yes
Open Weights No
Input Cost / 1M tokens $0.25
Output Cost / 1M tokens $1.50
Reasoning Cost / 1M tokens $1.50
Cache Read Cost / 1M tokens $0.03
Cache Write Cost / 1M tokens $0.08