Llama 3 8B Instruct AWQ model ID, context window & pricing
llama
Quick facts
Model ID workers-ai/@cf/meta/llama-3-8b-instruct-awq
Source Cloudflare AI Gateway
Context Window 128000
Pricing $0.12 input / $0.27 output per 1M tokens
Capabilities temperature control
Model overview
Llama 3 8B Instruct AWQ is an AI model from Cloudflare AI Gateway with 128000 token context window and text input support.
Published pricing is $0.12 input and $0.27 output per 1M tokens.
- Workloads that use text inputs with text outputs.
Model ID workers-ai/@cf/meta/llama-3-8b-instruct-awq
Provider Cloudflare AI Gateway
Family llama
Status -
Knowledge Cutoff -
Release Date 2025-04-03
Input Modalities text
Output Modalities text
Context Window 128000
Input Limit -
Output Limit 16384
Tool Calling No
Reasoning No
Structured Output -
Temperature Control Yes
Open Weights No
Input Cost / 1M tokens $0.12
Output Cost / 1M tokens $0.27
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -