GLM-4.6V Flash model ID, context window & pricing
glm
Quick facts
Model ID glm-4.6v-flash
Source LLM Gateway
Context Window 128000
Pricing -
Capabilities tool calling, reasoning, structured output, temperature control, open weights
Model overview
GLM-4.6V Flash is an AI model from LLM Gateway with 128000 token context window and text, image input support.
Public token pricing is not listed for this model in the current catalog source.
- Workloads that use text, image inputs with text outputs.
- Agent and tool workflows that need function calling.
- Reasoning-heavy prompts where stepwise problem solving matters.
Model ID glm-4.6v-flash
Provider LLM Gateway
Family glm
Status beta
Knowledge Cutoff -
Release Date 2025-12-08
Input Modalities text, image
Output Modalities text
Context Window 128000
Input Limit -
Output Limit 16000
Tool Calling Yes
Reasoning Yes
Structured Output Yes
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens -
Output Cost / 1M tokens -
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -