whichllm — Browse and compare AI model specs and pricing

Alibaba

Qwen3-Omni Flash Realtime model ID, context window & pricing

qwen

Quick facts

Model ID qwen3-omni-flash-realtime
Source Alibaba
Context Window 65536
Pricing $0.52 input / $1.99 output per 1M tokens
Capabilities tool calling, temperature control

Model overview

Qwen3-Omni Flash Realtime is an AI model from Alibaba with 65536 token context window and text, image, audio, video input support.

Published pricing is $0.52 input and $1.99 output per 1M tokens.

  • Workloads that use text, image, audio, video inputs with text, audio outputs.
  • Agent and tool workflows that need function calling.
Model ID qwen3-omni-flash-realtime
Provider Alibaba
Family qwen
Status -
Knowledge Cutoff 2024-04
Release Date 2025-09-15
Input Modalities text, image, audio, video
Output Modalities text, audio
Context Window 65536
Input Limit -
Output Limit 16384
Tool Calling Yes
Reasoning No
Structured Output -
Temperature Control Yes
Open Weights No
Input Cost / 1M tokens $0.52
Output Cost / 1M tokens $1.99
Reasoning Cost / 1M tokens -
Cache Read Cost / 1M tokens -
Cache Write Cost / 1M tokens -