whichllm — Browse and compare AI model specs and pricing

Cloudflare AI Gateway

Llama 3 8B Instruct AWQ

llama

Model ID workers-ai/@cf/meta/llama-3-8b-instruct-awq
Provider Cloudflare AI Gateway
Family llama
Status -
Knowledge Cutoff -
Release Date 2025-04-03
Input Modalities text
Output Modalities text
Context Window 128000
Input Limit -
Output Limit 16384
Tool Calling No
Reasoning No
Structured Output -
Temperature Control Yes
Open Weights No
Input Cost / 1M tokens $0.12
Output Cost / 1M tokens $0.27
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -