Quick facts

Model ID nvidia/llama-nemotron-embed-vl-1b-v2

Source Nvidia

Context Window 32768

Pricing -

Capabilities open weights

Model overview

llama-nemotron-embed-vl-1b-v2 is an AI model from Nvidia with 32768 token context window and text, image input support.

Public token pricing is not listed for this model in the current catalog source.

Model ID nvidia/llama-nemotron-embed-vl-1b-v2

Provider Nvidia

Family nemotron

Status -

Knowledge Cutoff -

Release Date 2026-02-10

Input Modalities text, image

Output Modalities text

Context Window 32768

Input Limit -

Output Limit 2048

Tool Calling No

Reasoning No

Structured Output -

Temperature Control No

Open Weights Yes

Input Cost / 1M tokens -

Output Cost / 1M tokens -

Reasoning Cost / 1M tokens -

Cache Read Cost / 1M tokens -

Cache Write Cost / 1M tokens -