Llama 2 7B Chat FP16 model ID, context window & pricing
llama
Quick facts
Model ID workers-ai/@cf/meta/llama-2-7b-chat-fp16
Source Cloudflare AI Gateway
Context Window 128000
Pricing $0.56 input / $6.67 output per 1M tokens
Capabilities temperature control
Model overview
Llama 2 7B Chat FP16 is an AI model from Cloudflare AI Gateway with 128000 token context window and text input support.
Published pricing is $0.56 input and $6.67 output per 1M tokens.
- Workloads that use text inputs with text outputs.
Model ID workers-ai/@cf/meta/llama-2-7b-chat-fp16
Provider Cloudflare AI Gateway
Family llama
Status -
Knowledge Cutoff -
Release Date 2025-04-03
Input Modalities text
Output Modalities text
Context Window 128000
Input Limit -
Output Limit 16384
Tool Calling No
Reasoning No
Structured Output -
Temperature Control Yes
Open Weights No
Input Cost / 1M tokens $0.56
Output Cost / 1M tokens $6.67
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -