whichllm — Browse and compare AI model specs and pricing

LLM Gateway

Qwen3 4B FP8

qwen

Model ID qwen3-4b-fp8
Provider LLM Gateway
Family qwen
Status -
Knowledge Cutoff -
Release Date 2025-04-28
Input Modalities text
Output Modalities text
Context Window 131072
Input Limit -
Output Limit 8192
Tool Calling Yes
Reasoning Yes
Structured Output Yes
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens $0.03
Output Cost / 1M tokens $0.05
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -