whichllm — Browse and compare AI model specs and pricing

Deep Infra

Llama 4 Maverick 17B FP8

llama

Model ID meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
Provider Deep Infra
Family llama
Status -
Knowledge Cutoff -
Release Date 2025-04-05
Input Modalities text, image
Output Modalities text
Context Window 1000000
Input Limit -
Output Limit 16384
Tool Calling Yes
Reasoning No
Structured Output -
Temperature Control -
Open Weights Yes
Input Cost / 1M tokens $0.15
Output Cost / 1M tokens $0.60
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -