GLM-4.7-Flash model ID, context window & pricing
glm
Quick facts
Model ID zai-org/glm-4.7-flash
Source NovitaAI
Context Window 200000
Pricing $0.07 input / $0.40 output per 1M tokens
Capabilities tool calling, reasoning, structured output, temperature control, open weights
Model overview
GLM-4.7-Flash is an AI model from NovitaAI with 200000 token context window and text input support.
Published pricing is $0.07 input and $0.40 output per 1M tokens.
- Workloads that use text inputs with text outputs.
- Agent and tool workflows that need function calling.
- Reasoning-heavy prompts where stepwise problem solving matters.
Model ID zai-org/glm-4.7-flash
Provider NovitaAI
Family glm
Status -
Knowledge Cutoff 2025-04
Release Date 2026-01-19
Input Modalities text
Output Modalities text
Context Window 200000
Input Limit -
Output Limit 128000
Tool Calling Yes
Reasoning Yes
Structured Output Yes
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens $0.07
Output Cost / 1M tokens $0.40
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens $0.01
Cache Write Cost / 1M tokens -