Qwen3-Omni Flash Realtime model ID, context window & pricing
qwen
Quick facts
Model ID qwen3-omni-flash-realtime
Source Alibaba (China)
Context Window 65536
Pricing $0.23 input / $0.92 output per 1M tokens
Capabilities tool calling, temperature control
Model overview
Qwen3-Omni Flash Realtime is an AI model from Alibaba (China) with 65536 token context window and text, image, audio input support.
Published pricing is $0.23 input and $0.92 output per 1M tokens.
- Workloads that use text, image, audio inputs with text, audio outputs.
- Agent and tool workflows that need function calling.
Model ID qwen3-omni-flash-realtime
Provider Alibaba (China)
Family qwen
Status -
Knowledge Cutoff 2024-04
Release Date 2025-09-15
Input Modalities text, image, audio
Output Modalities text, audio
Context Window 65536
Input Limit -
Output Limit 16384
Tool Calling Yes
Reasoning No
Structured Output -
Temperature Control Yes
Open Weights No
Input Cost / 1M tokens $0.23
Output Cost / 1M tokens $0.92
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -