whichllm — Browse and compare AI model specs and pricing

Venice AI

GLM 4.7 Flash Heretic

glm-flash

Model ID olafangensan-glm-4.7-flash-heretic
Provider Venice AI
Family glm-flash
Status -
Knowledge Cutoff -
Release Date 2026-02-04
Input Modalities text
Output Modalities text
Context Window 200000
Input Limit -
Output Limit 24000
Tool Calling Yes
Reasoning Yes
Structured Output Yes
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens $0.14
Output Cost / 1M tokens $0.80
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -