whichllm — Browse and compare AI model specs and pricing

Fireworks AI

GLM 5.1 Fast model ID, context window & pricing

glm

Quick facts

Model ID accounts/fireworks/routers/glm-5p1-fast
Source Fireworks AI
Context Window 202800
Pricing $2.80 input / $8.80 output per 1M tokens
Capabilities tool calling, reasoning, temperature control, open weights

Model overview

GLM 5.1 Fast is an AI model from Fireworks AI with 202800 token context window and text input support.

Published pricing is $2.80 input and $8.80 output per 1M tokens.

  • Workloads that use text inputs with text outputs.
  • Agent and tool workflows that need function calling.
  • Reasoning-heavy prompts where stepwise problem solving matters.
Model ID accounts/fireworks/routers/glm-5p1-fast
Provider Fireworks AI
Family glm
Status -
Knowledge Cutoff -
Release Date 2026-04-01
Input Modalities text
Output Modalities text
Context Window 202800
Input Limit -
Output Limit 131072
Tool Calling Yes
Reasoning Yes
Structured Output -
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens $2.80
Output Cost / 1M tokens $8.80
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens $0.52
Cache Write Cost / 1M tokens -