whichllm — Browse and compare AI model specs and pricing

Nvidia

nemotron-voicechat model ID, context window & pricing

models.dev synced record

Quick facts

Model ID nvidia/nemotron-voicechat
Source Nvidia
Context Window 128000
Pricing -
Capabilities tool calling, temperature control, open weights

Model overview

nemotron-voicechat is an AI model from Nvidia with 128000 token context window and text, audio input support.

Public token pricing is not listed for this model in the current catalog source.

  • Workloads that use text, audio inputs with text outputs.
  • Agent and tool workflows that need function calling.
Model ID nvidia/nemotron-voicechat
Provider Nvidia
Family -
Status -
Knowledge Cutoff -
Release Date 2026-03-16
Input Modalities text, audio
Output Modalities text
Context Window 128000
Input Limit -
Output Limit 8192
Tool Calling Yes
Reasoning No
Structured Output -
Temperature Control Yes
Open Weights Yes
Input Cost / 1M tokens -
Output Cost / 1M tokens -
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -