Active Speaker Detection model ID, context window & pricing
models.dev synced record
Quick facts
Model ID nvidia/active-speaker-detection
Source Nvidia
Context Window -
Pricing -
Capabilities open weights
Model overview
Active Speaker Detection is an AI model from Nvidia with an unpublished context window and video input support.
Public token pricing is not listed for this model in the current catalog source.
- Workloads that use video inputs with text outputs.
Model ID nvidia/active-speaker-detection
Provider Nvidia
Family -
Status -
Knowledge Cutoff -
Release Date 2026-04-16
Input Modalities video
Output Modalities text
Context Window -
Input Limit -
Output Limit 4096
Tool Calling No
Reasoning No
Structured Output -
Temperature Control No
Open Weights Yes
Input Cost / 1M tokens -
Output Cost / 1M tokens -
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -