NVIDIA Nemotron Cascade 2 model ID, context window & pricing
nemotron
Quick facts
Model ID nvidia/Nemotron-Cascade-2-30B-A3B
Source Vultr
Context Window 262144
Pricing $0.15 input / $0.60 output per 1M tokens
Capabilities tool calling, reasoning, temperature control, open weights
Model overview
NVIDIA Nemotron Cascade 2 is an AI model from Vultr with 262144 token context window and text input support.
Published pricing is $0.15 input and $0.60 output per 1M tokens.
- Workloads that use text inputs with text outputs.
- Agent and tool workflows that need function calling.
- Reasoning-heavy prompts where stepwise problem solving matters.
Model ID nvidia/Nemotron-Cascade-2-30B-A3B
Provider Vultr
Family nemotron
Status -
Knowledge Cutoff 2024-07
Release Date 2025-12-01
Input Modalities text
Output Modalities text
Context Window 262144
Input Limit -
Output Limit 131072
Tool Calling Yes
Reasoning Yes
Structured Output -
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens $0.15
Output Cost / 1M tokens $0.60
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -