Llama-4-Maverick-17B-128E-Instruct-FP8 model ID, context window & pricing
llama
Quick facts
Model ID llama-4-maverick-17b-128e-instruct-fp8
Source Llama
Context Window 128000
Pricing -
Capabilities tool calling, temperature control, open weights
Model overview
Llama-4-Maverick-17B-128E-Instruct-FP8 is an AI model from Llama with 128000 token context window and text, image input support.
Public token pricing is not listed for this model in the current catalog source.
- Workloads that use text, image inputs with text outputs.
- Agent and tool workflows that need function calling.
Model ID llama-4-maverick-17b-128e-instruct-fp8
Provider Llama
Family llama
Status -
Knowledge Cutoff 2024-08
Release Date 2025-04-05
Input Modalities text, image
Output Modalities text
Context Window 128000
Input Limit -
Output Limit 4096
Tool Calling Yes
Reasoning No
Structured Output -
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens -
Output Cost / 1M tokens -
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -