Quick facts

Model ID nvidia/active-speaker-detection

Source Nvidia

Context Window -

Pricing -

Capabilities open weights

Model overview

Active Speaker Detection is an AI model from Nvidia with an unpublished context window and video input support.

Public token pricing is not listed for this model in the current catalog source.

Model ID nvidia/active-speaker-detection

Provider Nvidia

Family -

Status -

Knowledge Cutoff -

Release Date 2026-04-16

Input Modalities video

Output Modalities text

Context Window -

Input Limit -

Output Limit 4096

Tool Calling No

Reasoning No

Structured Output -

Temperature Control No

Open Weights Yes

Input Cost / 1M tokens -

Output Cost / 1M tokens -

Reasoning Cost / 1M tokens -

Cache Read Cost / 1M tokens -

Cache Write Cost / 1M tokens -