llama-nemotron-embed-vl-1b-v2 model ID, context window & pricing
models.dev synced record
Quick facts
Model ID nvidia/llama-nemotron-embed-vl-1b-v2
Source Nvidia
Context Window 32768
Pricing -
Capabilities open weights
Model overview
llama-nemotron-embed-vl-1b-v2 is an AI model from Nvidia with 32768 token context window and text, image input support.
Public token pricing is not listed for this model in the current catalog source.
- Workloads that use text, image inputs with text outputs.
Model ID nvidia/llama-nemotron-embed-vl-1b-v2
Provider Nvidia
Family -
Status -
Knowledge Cutoff -
Release Date 2026-02-10
Input Modalities text, image
Output Modalities text
Context Window 32768
Input Limit -
Output Limit 2048
Tool Calling No
Reasoning No
Structured Output -
Temperature Control No
Open Weights Yes
Input Cost / 1M tokens -
Output Cost / 1M tokens -
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -