GLM 5 Turbo model ID, context window & pricing
glm
Quick facts
Model ID z-ai-glm-5-turbo
Source Venice AI
Context Window 200000
Pricing $1.20 input / $4.00 output per 1M tokens
Capabilities tool calling, reasoning, structured output, temperature control, open weights
Model overview
GLM 5 Turbo is an AI model from Venice AI with 200000 token context window and text input support.
Published pricing is $1.20 input and $4.00 output per 1M tokens.
- Workloads that use text inputs with text outputs.
- Agent and tool workflows that need function calling.
- Reasoning-heavy prompts where stepwise problem solving matters.
Model ID z-ai-glm-5-turbo
Provider Venice AI
Family glm
Status -
Knowledge Cutoff -
Release Date 2026-03-15
Input Modalities text
Output Modalities text
Context Window 200000
Input Limit -
Output Limit 32768
Tool Calling Yes
Reasoning Yes
Structured Output Yes
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens $1.20
Output Cost / 1M tokens $4.00
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens $0.24
Cache Write Cost / 1M tokens -