whichllm — Browse and compare AI model specs and pricing

Inference

Qwen 3 Embedding 4B

qwen

Model ID qwen/qwen3-embedding-4b
Provider Inference
Family qwen
Status -
Knowledge Cutoff 2024-12
Release Date 2025-01-01
Input Modalities text
Output Modalities text
Context Window 32000
Input Limit -
Output Limit 2048
Tool Calling No
Reasoning No
Structured Output -
Temperature Control No
Open Weights Yes
Input Cost / 1M tokens $0.01
Output Cost / 1M tokens -
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -