Quick facts

Model ID nvidia/Nemotron-3-Nano-Omni

Source Nebius Token Factory

Context Window 65536

Pricing $0.06 input / $0.24 output per 1M tokens

Capabilities tool calling, reasoning, structured output, temperature control, open weights

Model overview

Nemotron-3-Nano-Omni is an AI model from Nebius Token Factory with 65536 token context window and text input support.

Published pricing is $0.06 input and $0.24 output per 1M tokens.

Model ID nvidia/Nemotron-3-Nano-Omni

Provider Nebius Token Factory

Family nemotron

Status -

Knowledge Cutoff 2025-01

Release Date 2025-01-20

Input Modalities text

Output Modalities text

Context Window 65536

Input Limit 60000

Output Limit 8192

Tool Calling Yes

Reasoning Yes

Structured Output Yes

Temperature Control Yes

Open Weights Yes

Input Cost / 1M tokens $0.06

Output Cost / 1M tokens $0.24

Reasoning Cost / 1M tokens -

Cache Read Cost / 1M tokens $0.01

Cache Write Cost / 1M tokens $0.07