whichllm — Browse and compare AI model specs and pricing

Cloudflare AI Gateway

Llama 4 Scout 17B 16E Instruct

llama

Model ID workers-ai/@cf/meta/llama-4-scout-17b-16e-instruct
Provider Cloudflare AI Gateway
Family llama
Status -
Knowledge Cutoff -
Release Date 2025-04-16
Input Modalities text
Output Modalities text
Context Window 128000
Input Limit -
Output Limit 16384
Tool Calling No
Reasoning No
Structured Output -
Temperature Control Yes
Open Weights No
Input Cost / 1M tokens $0.27
Output Cost / 1M tokens $0.85
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -