whichllm — Browse and compare AI model specs and pricing

Cloudflare Workers AI

Llama 4 Scout 17B 16E Instruct model ID, context window & pricing

llama

Quick facts

Model ID @cf/meta/llama-4-scout-17b-16e-instruct
Source Cloudflare Workers AI
Context Window 131000
Pricing $0.27 input / $0.85 output per 1M tokens
Capabilities tool calling, temperature control, open weights

Model overview

Llama 4 Scout 17B 16E Instruct is an AI model from Cloudflare Workers AI with 131000 token context window and text, image input support.

Published pricing is $0.27 input and $0.85 output per 1M tokens.

  • Workloads that use text, image inputs with text outputs.
  • Agent and tool workflows that need function calling.
Model ID @cf/meta/llama-4-scout-17b-16e-instruct
Provider Cloudflare Workers AI
Family llama
Status -
Knowledge Cutoff 2024-08
Release Date 2025-04-05
Input Modalities text, image
Output Modalities text
Context Window 131000
Input Limit -
Output Limit 16384
Tool Calling Yes
Reasoning No
Structured Output No
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens $0.27
Output Cost / 1M tokens $0.85
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -