Quick facts

Model ID qwen/qwen3-embedding-4b

Source Inference

Context Window 32000

Pricing $0.01 input / - output per 1M tokens

Capabilities open weights

Model overview

Qwen 3 Embedding 4B is an AI model from Inference with 32000 token context window and text input support.

Published pricing is $0.01 input and - output per 1M tokens.

Model ID qwen/qwen3-embedding-4b

Provider Inference

Family qwen

Status -

Knowledge Cutoff 2024-12

Release Date 2025-01-01

Input Modalities text

Output Modalities text

Context Window 32000

Input Limit -

Output Limit 2048

Tool Calling No

Reasoning No

Structured Output -

Temperature Control No

Open Weights Yes

Input Cost / 1M tokens $0.01

Output Cost / 1M tokens -

Reasoning Cost / 1M tokens -

Cache Read Cost / 1M tokens -

Cache Write Cost / 1M tokens -