Quick facts

Model ID meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8

Source IO.NET

Context Window 430000

Pricing $0.15 input / $0.60 output per 1M tokens

Capabilities tool calling, temperature control, open weights

Model overview

Llama 4 Maverick 17B 128E Instruct is an AI model from IO.NET with 430000 token context window and text, image input support.

Published pricing is $0.15 input and $0.60 output per 1M tokens.

Model ID meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8

Provider IO.NET

Family llama

Status -

Knowledge Cutoff 2024-12

Release Date 2025-01-15

Input Modalities text, image

Output Modalities text

Context Window 430000

Input Limit -

Output Limit 4096

Tool Calling Yes

Reasoning No

Structured Output -

Temperature Control Yes

Open Weights Yes

Input Cost / 1M tokens $0.15

Output Cost / 1M tokens $0.60

Reasoning Cost / 1M tokens -

Cache Read Cost / 1M tokens $0.07

Cache Write Cost / 1M tokens $0.30