GLM 5.1 model ID, context window & pricing
glm
Quick facts
Model ID zai-org/GLM-5.1-FP8
Source Inceptron
Context Window 202752
Pricing $1.40 input / $4.40 output per 1M tokens
Capabilities tool calling, reasoning, structured output, temperature control
Model overview
GLM 5.1 is an AI model from Inceptron with 202752 token context window and text input support.
Published pricing is $1.40 input and $4.40 output per 1M tokens.
- Workloads that use text inputs with text outputs.
- Agent and tool workflows that need function calling.
- Reasoning-heavy prompts where stepwise problem solving matters.
Model ID zai-org/GLM-5.1-FP8
Provider Inceptron
Family glm
Status -
Knowledge Cutoff -
Release Date 2026-03-27
Input Modalities text
Output Modalities text
Context Window 202752
Input Limit -
Output Limit 202752
Tool Calling Yes
Reasoning Yes
Structured Output Yes
Temperature Control Yes
Open Weights No
Input Cost / 1M tokens $1.40
Output Cost / 1M tokens $4.40
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens $0.26
Cache Write Cost / 1M tokens $0.00