Llama 3.1 8B Instruct FP8 model ID, context window & pricing
llama
Quick facts
Model ID workers-ai/@cf/meta/llama-3.1-8b-instruct-fp8
Source Cloudflare AI Gateway
Context Window 128000
Pricing $0.15 input / $0.29 output per 1M tokens
Capabilities temperature control
Model overview
Llama 3.1 8B Instruct FP8 is an AI model from Cloudflare AI Gateway with 128000 token context window and text input support.
Published pricing is $0.15 input and $0.29 output per 1M tokens.
- Workloads that use text inputs with text outputs.
Model ID workers-ai/@cf/meta/llama-3.1-8b-instruct-fp8
Provider Cloudflare AI Gateway
Family llama
Status -
Knowledge Cutoff -
Release Date 2025-04-03
Input Modalities text
Output Modalities text
Context Window 128000
Input Limit -
Output Limit 16384
Tool Calling No
Reasoning No
Structured Output -
Temperature Control Yes
Open Weights No
Input Cost / 1M tokens $0.15
Output Cost / 1M tokens $0.29
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -