whichllm — Browse and compare AI model specs and pricing

IO.NET

Llama 4 Maverick 17B 128E Instruct model ID, context window & pricing

llama

Quick facts

Model ID meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
Source IO.NET
Context Window 430000
Pricing $0.15 input / $0.60 output per 1M tokens
Capabilities tool calling, temperature control, open weights

Model overview

Llama 4 Maverick 17B 128E Instruct is an AI model from IO.NET with 430000 token context window and text, image input support.

Published pricing is $0.15 input and $0.60 output per 1M tokens.

  • Workloads that use text, image inputs with text outputs.
  • Agent and tool workflows that need function calling.
Model ID meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
Provider IO.NET
Family llama
Status -
Knowledge Cutoff 2024-12
Release Date 2025-01-15
Input Modalities text, image
Output Modalities text
Context Window 430000
Input Limit -
Output Limit 4096
Tool Calling Yes
Reasoning No
Structured Output -
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens $0.15
Output Cost / 1M tokens $0.60
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens $0.07
Cache Write Cost / 1M tokens $0.30