Quick facts

Model ID llama-4-maverick-17b-128e-instruct-fp8

Source Llama

Context Window 128000

Pricing -

Capabilities tool calling, temperature control, open weights

Model overview

Llama-4-Maverick-17B-128E-Instruct-FP8 is an AI model from Llama with 128000 token context window and text, image input support.

Public token pricing is not listed for this model in the current catalog source.

Model ID llama-4-maverick-17b-128e-instruct-fp8

Provider Llama

Family llama

Status -

Knowledge Cutoff 2024-08

Release Date 2025-04-05

Input Modalities text, image

Output Modalities text

Context Window 128000

Input Limit -

Output Limit 4096

Tool Calling Yes

Reasoning No

Structured Output -

Temperature Control Yes

Open Weights Yes

Input Cost / 1M tokens -

Output Cost / 1M tokens -

Reasoning Cost / 1M tokens -

Cache Read Cost / 1M tokens -

Cache Write Cost / 1M tokens -