Quick facts

Model ID zai-org/glm-4.7-flash

Source NovitaAI

Context Window 200000

Pricing $0.07 input / $0.40 output per 1M tokens

Capabilities tool calling, reasoning, structured output, temperature control, open weights

Model overview

GLM-4.7-Flash is an AI model from NovitaAI with 200000 token context window and text input support.

Published pricing is $0.07 input and $0.40 output per 1M tokens.

Model ID zai-org/glm-4.7-flash

Provider NovitaAI

Family glm

Status -

Knowledge Cutoff 2025-04

Release Date 2026-01-19

Input Modalities text

Output Modalities text

Context Window 200000

Input Limit -

Output Limit 128000

Tool Calling Yes

Reasoning Yes

Structured Output Yes

Temperature Control Yes

Open Weights Yes

Input Cost / 1M tokens $0.07

Output Cost / 1M tokens $0.40

Reasoning Cost / 1M tokens -

Cache Read Cost / 1M tokens $0.01

Cache Write Cost / 1M tokens -