Quick facts

Model ID z-ai-glm-5-turbo

Source Venice AI

Context Window 200000

Pricing $1.20 input / $4.00 output per 1M tokens

Capabilities tool calling, reasoning, structured output, temperature control, open weights

Model overview

GLM 5 Turbo is an AI model from Venice AI with 200000 token context window and text input support.

Published pricing is $1.20 input and $4.00 output per 1M tokens.

Model ID z-ai-glm-5-turbo

Provider Venice AI

Family glm

Status -

Knowledge Cutoff -

Release Date 2026-03-15

Input Modalities text

Output Modalities text

Context Window 200000

Input Limit -

Output Limit 32768

Tool Calling Yes

Reasoning Yes

Structured Output Yes

Temperature Control Yes

Open Weights Yes

Input Cost / 1M tokens $1.20

Output Cost / 1M tokens $4.00

Reasoning Cost / 1M tokens -

Cache Read Cost / 1M tokens $0.24

Cache Write Cost / 1M tokens -