whichllm — Browse and compare AI model specs and pricing

OpenRouter

GLM-4.7-Flash model ID, context window & pricing

glm-flash

Quick facts

Model ID z-ai/glm-4.7-flash
Source OpenRouter
Context Window 202752
Pricing $0.06 input / $0.40 output per 1M tokens
Capabilities tool calling, reasoning, structured output, temperature control, open weights

Model overview

GLM-4.7-Flash is an AI model from OpenRouter with 202752 token context window and text input support.

Published pricing is $0.06 input and $0.40 output per 1M tokens.

  • Workloads that use text inputs with text outputs.
  • Agent and tool workflows that need function calling.
  • Reasoning-heavy prompts where stepwise problem solving matters.
Model ID z-ai/glm-4.7-flash
Provider OpenRouter
Family glm-flash
Status -
Knowledge Cutoff 2025-04
Release Date 2026-01-19
Input Modalities text
Output Modalities text
Context Window 202752
Input Limit -
Output Limit 16384
Tool Calling Yes
Reasoning Yes
Structured Output Yes
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens $0.06
Output Cost / 1M tokens $0.40
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens $0.01
Cache Write Cost / 1M tokens -