Nemotron 3 Ultra model ID, context window & pricing
nemotron
Quick facts
Model ID nvidia/nemotron-3-ultra-550b-a55b
Source Vercel AI Gateway
Context Window 1000000
Pricing $0.60 input / $2.40 output per 1M tokens
Capabilities tool calling, reasoning, temperature control
Model overview
Nemotron 3 Ultra is an AI model from Vercel AI Gateway with 1000000 token context window and text input support.
Published pricing is $0.60 input and $2.40 output per 1M tokens.
- Workloads that use text inputs with text outputs.
- Agent and tool workflows that need function calling.
- Reasoning-heavy prompts where stepwise problem solving matters.
Model ID nvidia/nemotron-3-ultra-550b-a55b
Provider Vercel AI Gateway
Family nemotron
Status -
Knowledge Cutoff -
Release Date 2026-06-04
Input Modalities text
Output Modalities text
Context Window 1000000
Input Limit -
Output Limit 65000
Tool Calling Yes
Reasoning Yes
Structured Output -
Temperature Control Yes
Open Weights No
Input Cost / 1M tokens $0.60
Output Cost / 1M tokens $2.40
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens $0.12
Cache Write Cost / 1M tokens -