Models / Alibaba
Qwen-Flash
GALegacy stable Flash alias = qwen-flash-2025-07-28; the recommended replacement for the discontinued Qwen-Turbo. Official International TIERED pricing: 0<token<=256K $0.05 in / $0.4 out; 256K<token<=1M $0.25 in / $2 out per 1M. Supports 50% batch-inference discount and context caching. Context window 1M. Max output not on accessible official page (null). Still GA on pricing page Jun 22, 2026. Text-only.
Provider
Alibaba
Status
GA
Input price
$0.05 / 1M tokens
Output price
$0.4 / 1M tokens
Cached input
—
Blended price
$0.138 / 1M tokens
Context window
1,000,000 tokens (1M)
Max output
—
Modality
text
Knowledge cutoff
—
Released
28 Jul 2025
API string
qwen-flash
Source: Alibaba official documentation ↗
Track Qwen-Flash price & status changes
New models, price cuts, and deprecations — a short email when something actually changes. No spam, unsubscribe anytime.
◎ You're on the watch list. We'll ping you the moment a model launches, changes price, or gets deprecated.
Free forever · powered by the same data on this page.