Llama 3.3 70B Instruct model ID, context window & pricing
llama
Quick facts
Model ID meta/llama-3.3-70b-instruct-maas
Source Vertex
Context Window 128000
Pricing $0.72 input / $0.72 output per 1M tokens
Capabilities tool calling, structured output, temperature control, open weights
Model overview
Llama 3.3 70B Instruct is an AI model from Vertex with 128000 token context window and text input support.
Published pricing is $0.72 input and $0.72 output per 1M tokens.
- Workloads that use text inputs with text outputs.
- Agent and tool workflows that need function calling.
Model ID meta/llama-3.3-70b-instruct-maas
Provider Vertex
Family llama
Status -
Knowledge Cutoff 2023-12
Release Date 2025-04-29
Input Modalities text
Output Modalities text
Context Window 128000
Input Limit -
Output Limit 8192
Tool Calling Yes
Reasoning No
Structured Output Yes
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens $0.72
Output Cost / 1M tokens $0.72
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -