whichllm — Browse and compare AI model specs and pricing

Nebius Token Factory

Nemotron-3-Nano-Omni model ID, context window & pricing

models.dev synced record

Quick facts

Model ID nvidia/Nemotron-3-Nano-Omni
Source Nebius Token Factory
Context Window 65536
Pricing $0.06 input / $0.24 output per 1M tokens
Capabilities tool calling, reasoning, structured output, temperature control, open weights

Model overview

Nemotron-3-Nano-Omni is an AI model from Nebius Token Factory with 65536 token context window and text input support.

Published pricing is $0.06 input and $0.24 output per 1M tokens.

  • Workloads that use text inputs with text outputs.
  • Agent and tool workflows that need function calling.
  • Reasoning-heavy prompts where stepwise problem solving matters.
Model ID nvidia/Nemotron-3-Nano-Omni
Provider Nebius Token Factory
Family -
Status -
Knowledge Cutoff 2025-01
Release Date 2025-01-20
Input Modalities text
Output Modalities text
Context Window 65536
Input Limit 60000
Output Limit 8192
Tool Calling Yes
Reasoning Yes
Structured Output Yes
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens $0.06
Output Cost / 1M tokens $0.24
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens $0.01
Cache Write Cost / 1M tokens $0.07