whichllm — Browse and compare AI model specs and pricing

Inference

Google Gemma 3

gemma

Model ID google/gemma-3
Provider Inference
Family gemma
Status -
Knowledge Cutoff 2024-12
Release Date 2025-01-01
Input Modalities text, image
Output Modalities text
Context Window 125000
Input Limit -
Output Limit 4096
Tool Calling Yes
Reasoning No
Structured Output -
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens $0.15
Output Cost / 1M tokens $0.30
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -