Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Qwen

4.5 ★ — The broadest and most-adopted open-weight family in the world, almost all of it under clean Apache 2.0, spanning phone-size to cluster-scale — held back from a perfect score only by the recent pivot to keeping the frontier flagships closed, which muddies an otherwise exemplary open posture.

Type

big-tech-lab

Country

Founded

2023

License posture

mixed

Website

https://qwen.ai

open-weight
china-based
big-tech-lab
commercial-friendly
apache-2-0
multilingual

Quick Take

Alibaba's Qwen is the world's broadest open-weight model family — Apache 2.0 weights spanning phone-size to 397 billion parameters — now paired with a closed, API-only frontier flagship.

Who They Are

Qwen (from the Chinese "Tongyi Qianwen") is the large-model team inside Alibaba Cloud, the cloud-computing arm of Alibaba Group. Since 2023 it has been the most prolific frontier-model shipper of any major tech company, releasing models at a pace that outstrips most dedicated AI labs — dense models, mixture-of-experts models, coding specialists, vision-language models, audio models, and embeddings, across more than a dozen size points.

The strategy behind that firehose is straightforward and worth understanding, because it explains why the models are so generously licensed. Alibaba Cloud makes its money from cloud compute and API access, not from selling model licenses. Open-sourcing Qwen drives adoption; adoption drives people to run those models — often on Alibaba Cloud. The result is that Qwen has become one of the most-downloaded model families on Hugging Face and the default open-weight choice for a huge swath of developers and businesses worldwide.

Model Philosophy

For most of its history, Qwen's answer to "how open are you?" was "very": nearly the entire lineup ships under Apache 2.0, the gold standard of permissive licenses — unrestricted commercial use, modification, and redistribution, no royalties, no user-count carve-outs. That is more permissive than Meta's Llama license and on par with the most open models anywhere.

In 2026 the posture got more nuanced. Alibaba began holding its absolute frontier models closed: the agent-tuned "Max" flagships (Qwen3.6 Max, then Qwen3.7-Max) and their multimodal "Plus" siblings are proprietary and API-only, with no downloadable weights. The tier just below — the numbered open-weight models like Qwen3.5-397B-A17B and the Qwen3 and Qwen3.6 families — stays Apache 2.0. So the lineup now splits cleanly: open-weight workhorses you can self-host freely, and a closed frontier model you can only rent through the API. For a business reader, that split is the single most important thing to keep straight.

What To Know Before You Commit

Match the model to the job, and mind the open/closed line. If you want maximum capability and you're comfortable using a hosted API, the closed Max flagship is the top of the range. If you want to own your stack — self-host, keep data in-house, avoid vendor lock-in, fine-tune freely — the open Apache 2.0 models are the reason Qwen is so widely used, and they run on everything from a laptop to an H100 cluster.

The China-jurisdiction consideration applies the same way it does for any Chinese lab: the hosted DashScope API routes data to Alibaba Cloud under Chinese law, while the open weights, run on your own infrastructure, carry no such routing. Qwen has drawn far less government-restriction attention than DeepSeek did, but the underlying data-governance logic is identical — and for the closed Max/Plus models, the hosted API is the only way to use them, so there's no self-host escape hatch for those specific models.

How They Compare

Against Meta, Qwen is more permissively licensed across its open tier (clean Apache 2.0 versus Llama's community license with its large-user carve-out) and offers a far wider range of sizes, but carries the China-jurisdiction consideration Meta doesn't. Against DeepSeek, the two are the leading Chinese open-weight labs — DeepSeek tends to win on raw frontier capability-per-dollar and keeps its flagships MIT-open, while Qwen wins on breadth of sizes, Apache licensing, and multilingual coverage, but has moved its very top models closed. Against the Western closed labs (OpenAI, Anthropic, Google) and Mistral AI, Qwen's open tier is the pitch: competitive capability you can download and self-host for free, in exchange for the data-governance questions a US- or EU-based vendor doesn't raise.

Original Models

Qwen3 7

Alibaba's closed, agent-first flagship: frontier-tier coding and reasoning with a million-token memory, priced at roughly half its Western rivals — but API-only, with no weights to own.

Qwen3.7-Plus — the closed, multimodal sibling of the Max flagship; vision input, 1M context, API-only, proprietary.

Qwen3 6

The best open model you can actually run yourself: a dense 27B that beats Qwen's own 397B flagship on agentic coding while fitting on a single consumer GPU, under Apache 2.0.

The Qwen3.6 open MoE (35B-A3B) — the efficient sibling of the dense 3.6-27B, Apache 2.0, single-GPU-friendly.

Qwen3 5

The most capable model you can legally download and self-host with no strings — a 397B multimodal Apache-2.0 flagship that rivals the frontier and speaks 201 languages.

Qwen3 Coder

Qwen's open coding workhorse: a 30B mixture-of-experts model tuned for agentic coding and tool-calling, with repo-scale context, that runs on a single GPU — under Apache 2.0.

Qwen3

The 0.6B dense Qwen3 — the family's smallest model, Apache 2.0, for highly constrained and edge deployments.

The 1.7B dense Qwen3 — an edge/on-device size, Apache 2.0, for phones and embedded use.

The 14B dense Qwen3 — a balanced single-GPU generalist under Apache 2.0.

The open generalist that defined Qwen3: a 235B Apache-2.0 mixture-of-experts model that went toe-to-toe with the closed frontier and became one of the most-deployed open models anywhere.

The general-purpose 30B-A3B MoE — fast, single-GPU-friendly, Apache 2.0; the all-rounder counterpart to Qwen3-Coder.

The 32B dense Qwen3 — the largest single-GPU-friendly dense model in the family, Apache 2.0, with hybrid thinking modes.

The 4B dense Qwen3 — surprisingly strong for its size, Apache 2.0, runs on modest hardware.

The 8B dense Qwen3 — laptop-feasible, Apache 2.0, a common base for local apps and fine-tunes.

Qwq

QwQ-32B — Qwen's early dedicated reasoning model, Apache 2.0; still capable but superseded by Qwen3's native thinking modes.

Qwen2 5

Qwen2.5's 14B general model, Apache 2.0 — the base for DeepSeek-R1-Distill-Qwen-14B.

Qwen2.5's 32B general model, Apache 2.0 — the base for DeepSeek-R1-Distill-Qwen-32B (the standout o1-mini-class distill).

Qwen2 5 Math

Qwen2.5's 1.5B math model, Apache 2.0 — notable as the base for DeepSeek-R1-Distill-Qwen-1.5B.

Qwen2.5's 7B math model, Apache 2.0 — the base for DeepSeek-R1-Distill-Qwen-7B.

Qwen

Quick Take

Who They Are

Model Philosophy

What To Know Before You Commit

How They Compare

Original Models

Qwen3 7

Identity

Technical specs

License

Access

Sources

Qwen3 6

Identity

Technical specs

License

Access

Sources

Qwen3 5

Qwen3 Coder

Qwen3

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Qwq

Identity

Technical specs

License

Access

Sources

Qwen2 5

Identity

Technical specs

License

Access

Sources

Identity

Technical specs

License

Access

Sources

Qwen2 5 Math

Identity

Technical specs

License

Access

Sources