Quick facts

Model ID z-ai/glm-4.7-flash

Source OpenRouter

Context Window 202752

Pricing $0.06 input / $0.40 output per 1M tokens

Capabilities tool calling, reasoning, structured output, temperature control, open weights

Model overview

GLM-4.7-Flash is an AI model from OpenRouter with 202752 token context window and text input support.

Published pricing is $0.06 input and $0.40 output per 1M tokens.

Model ID z-ai/glm-4.7-flash

Provider OpenRouter

Family glm-flash

Status -

Knowledge Cutoff 2025-04

Release Date 2026-01-19

Input Modalities text

Output Modalities text

Context Window 202752

Input Limit -

Output Limit 16384

Tool Calling Yes

Reasoning Yes

Structured Output Yes

Temperature Control Yes

Open Weights Yes

Input Cost / 1M tokens $0.06

Output Cost / 1M tokens $0.40

Reasoning Cost / 1M tokens -

Cache Read Cost / 1M tokens $0.01

Cache Write Cost / 1M tokens -