GLM-4.5-Flash model ID, context window & pricing
glm-flash
Quick facts
Model ID glm-4.5-flash
Source LLM Gateway
Context Window 131072
Pricing -
Capabilities tool calling, reasoning, temperature control
Model overview
GLM-4.5-Flash is an AI model from LLM Gateway with 131072 token context window and text input support.
Public token pricing is not listed for this model in the current catalog source.
- Workloads that use text inputs with text outputs.
- Agent and tool workflows that need function calling.
- Reasoning-heavy prompts where stepwise problem solving matters.
Model ID glm-4.5-flash
Provider LLM Gateway
Family glm-flash
Status -
Knowledge Cutoff 2025-04
Release Date 2025-07-28
Input Modalities text
Output Modalities text
Context Window 131072
Input Limit -
Output Limit 98304
Tool Calling Yes
Reasoning Yes
Structured Output -
Temperature Control Yes
Open Weights No
Input Cost / 1M tokens -
Output Cost / 1M tokens -
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens $0.00
Cache Write Cost / 1M tokens $0.00