whichllm — Browse and compare AI model specs and pricing

Vultr

NVIDIA Nemotron 3 Nano Omni model ID, context window & pricing

nemotron

Quick facts

Model ID nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16
Source Vultr
Context Window 262144
Pricing $0.13 input / $0.38 output per 1M tokens
Capabilities tool calling, reasoning, temperature control, open weights

Model overview

NVIDIA Nemotron 3 Nano Omni is an AI model from Vultr with 262144 token context window and text input support.

Published pricing is $0.13 input and $0.38 output per 1M tokens.

  • Workloads that use text inputs with text outputs.
  • Agent and tool workflows that need function calling.
  • Reasoning-heavy prompts where stepwise problem solving matters.
Model ID nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16
Provider Vultr
Family nemotron
Status -
Knowledge Cutoff 2025-05
Release Date 2026-04-28
Input Modalities text
Output Modalities text
Context Window 262144
Input Limit -
Output Limit 131072
Tool Calling Yes
Reasoning Yes
Structured Output -
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens $0.13
Output Cost / 1M tokens $0.38
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -