whichllm — Browse and compare AI model specs and pricing

LLM Gateway

GLM-4.5-Flash model ID, context window & pricing

glm-flash

Quick facts

Model ID glm-4.5-flash
Source LLM Gateway
Context Window 131072
Pricing -
Capabilities tool calling, reasoning, temperature control

Model overview

GLM-4.5-Flash is an AI model from LLM Gateway with 131072 token context window and text input support.

Public token pricing is not listed for this model in the current catalog source.

  • Workloads that use text inputs with text outputs.
  • Agent and tool workflows that need function calling.
  • Reasoning-heavy prompts where stepwise problem solving matters.
Model ID glm-4.5-flash
Provider LLM Gateway
Family glm-flash
Status -
Knowledge Cutoff 2025-04
Release Date 2025-07-28
Input Modalities text
Output Modalities text
Context Window 131072
Input Limit -
Output Limit 98304
Tool Calling Yes
Reasoning Yes
Structured Output -
Temperature Control Yes
Open Weights No
Input Cost / 1M tokens -
Output Cost / 1M tokens -
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens $0.00
Cache Write Cost / 1M tokens $0.00