whichllm — Browse and compare AI model specs and pricing

Inference

Qwen 3 Embedding 4B model ID, context window & pricing

qwen

Quick facts

Model ID qwen/qwen3-embedding-4b
Source Inference
Context Window 32000
Pricing $0.01 input / - output per 1M tokens
Capabilities open weights

Model overview

Qwen 3 Embedding 4B is an AI model from Inference with 32000 token context window and text input support.

Published pricing is $0.01 input and - output per 1M tokens.

  • Workloads that use text inputs with text outputs.
Model ID qwen/qwen3-embedding-4b
Provider Inference
Family qwen
Status -
Knowledge Cutoff 2024-12
Release Date 2025-01-01
Input Modalities text
Output Modalities text
Context Window 32000
Input Limit -
Output Limit 2048
Tool Calling No
Reasoning No
Structured Output -
Temperature Control No
Open Weights Yes
Input Cost / 1M tokens $0.01
Output Cost / 1M tokens -
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -