AI model maker · CN

Alibaba (Qwen Team).

Qwen series — broad open-weights family from text to vision to coding.

Visit Alibaba (Qwen Team) ↗

Models from Alibaba (Qwen Team).

Qwen 2.5 72B

73B
by Alibaba (Qwen Team) · Qwen · 128,000 ctx

Alibaba's flagship open-weight LLM — 72B dense.

Qwen 2.5 7B

8B
by Alibaba (Qwen Team) · Qwen · 128,000 ctx

7B Qwen 2.5 — most popular Qwen variant on Ollama.

Qwen 2.5 14B

15B
by Alibaba (Qwen Team) · Qwen · 128,000 ctx

14B Qwen 2.5 — sweet spot for single-GPU local hosting.

Qwen 2.5 32B

33B
by Alibaba (Qwen Team) · Qwen · 128,000 ctx

32B Qwen 2.5 — laptop-class workhorse.

Qwen 2.5 3B

3B
by Alibaba (Qwen Team) · Qwen · 32,768 ctx

3B Qwen 2.5 — laptop / edge target.

Qwen 2.5 Coder 32B

33B
by Alibaba (Qwen Team) · Qwen · 128,000 ctx

Alibaba's open-weight coding model — best in class for 32B.

Qwen 3 235B

235B
by Alibaba (Qwen Team) · Qwen 3 · 128,000 ctx

Alibaba's frontier MoE — 235B total / 22B active.

Qwen 3 32B

33B
by Alibaba (Qwen Team) · Qwen 3 · 128,000 ctx

Dense 32B Qwen 3.

Qwen 3 14B

15B
by Alibaba (Qwen Team) · Qwen 3 · 128,000 ctx

Qwen 3 8B

8B
by Alibaba (Qwen Team) · Qwen 3 · 128,000 ctx

Qwen 3 4B

4B
by Alibaba (Qwen Team) · Qwen 3 · 32,768 ctx

Qwen 2 (1.5B)

5B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen2.5 14B

14B
by Alibaba (Qwen Team) · 131,072 ctx

Qwen 2.5 14B Instruct

14B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen2.5 1.5B

5B
by Alibaba (Qwen Team) · 131,072 ctx

Qwen2.5 1.5B Instruct

5B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen2.5 32B

32B
by Alibaba (Qwen Team) · 131,072 ctx

Qwen2.5 32B Instruct

32B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen2.5 3B Instruct

3B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen2.5 72B

72B
by Alibaba (Qwen Team) · 131,072 ctx

Qwen2.5 72B Instruct

72B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen2.5 72B Instruct Turbo

72B
by Alibaba (Qwen Team) · 131,072 ctx

Qwen2.5 7B

7B
by Alibaba (Qwen Team) · 131,072 ctx

Qwen2.5 7B Instruct

7B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen2.5 7B Instruct Turbo

7B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen 2.5 Coder 32B Instruct

32B
by Alibaba (Qwen Team) · 16,384 ctx

Qwen 2 (72B)

72B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen 2 (7B)

7B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen 2 Instruct (1.5B)

5B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen2-VL (72B) Instruct

72B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen3 0.6B

6B
by Alibaba (Qwen Team) · 40,960 ctx

Qwen3 0.6B Base

6B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen3 14B Base

14B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen3 1.7B

7B
by Alibaba (Qwen Team) · 40,960 ctx

Qwen3 1.7B Base

7B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen3-235B-A22B-Instruct-2507

235B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3-235B-A22B-Instruct-2507 is the updated version of the Qwen3-235B-A22B non-thinking mode, featuring Significant improvements in gene...

Qwen3 235B A22B Instruct 2507 FP8 Throughput

235B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3 30B A3b Base

30B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen3 30B A3B Instruct 2507 Lora

30B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3 4B Base

4B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen3 4B Instruct 2507

4B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3.5-0.8B

8B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3.5-0.8B is Alibaba's smallest model in the Qwen3.5 series, featuring a hybrid Gated Delta Networks and sparse Mixture-of-Experts arc...

Qwen3.5 122B A10b Fp8

122B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3.5-2B

2B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3.5-2B is a compact yet capable model from Alibaba's Qwen3.5 series. It features a 262K token context window, support for 201 languag...

Qwen3.5-4B

4B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3.5-4B is a mid-size model from Alibaba's Qwen3.5 series that delivers a strong balance of performance and efficiency. It features a ...

Qwen3.5 9B Fp8

9B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3.6 35B A3b Fp8

35B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3 8B Base

8B
by Alibaba (Qwen Team) · 32,768 ctx

Qwen3 8B Lora

8B
by Alibaba (Qwen Team) · 40,960 ctx

Qwen3 Coder 480B A35B Instruct Fp8

480B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3-Coder-480B-A35B-Instruct-Turbo

480B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3-Coder-480B-A35B-Instruct is the Qwen3's most agentic code model, featuring Significant Performance on Agentic Coding, Agentic Brows...

Qwen3 Coder Next Fp8

text
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3 Next 80B A3b Instruct Fp8

80B
by Alibaba (Qwen Team)

Qwen3-VL-235B-A22B-Instruct-FP8

235B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen: Qwen2.5 7B Instruct

7B
by Alibaba (Qwen Team) · 131,072 ctx

Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more...

Qwen: Qwen2.5 VL 72B Instruct

72B
by Alibaba (Qwen Team) · 131,072 ctx

Qwen2.5-VL is proficient in recognizing common objects such as flowers, birds, fish, and insects. It is also highly capable of analyzing ...

Qwen: Qwen3 14B

14B
by Alibaba (Qwen Team) · 131,702 ctx

Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialo...

Qwen: Qwen3 235B A22B

235B
by Alibaba (Qwen Team) · 131,072 ctx

Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supp...

Qwen: Qwen3 235B A22B Instruct 2507

235B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts language model based on the Qwen3-235B architecture...

Qwen: Qwen3 235B A22B Thinking 2507

235B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning ...

Qwen: Qwen3 30B A3B

30B
by Alibaba (Qwen Team) · 131,072 ctx

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to e...

Qwen: Qwen3 30B A3B Instruct 2507

30B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3-30B-A3B-Instruct-2507 is a 30.5B-parameter mixture-of-experts language model from Qwen, with 3.3B active parameters per inference. ...

Qwen: Qwen3 30B A3B Thinking 2507

30B
by Alibaba (Qwen Team) · 131,072 ctx

Qwen3-30B-A3B-Thinking-2507 is a 30B parameter Mixture-of-Experts reasoning model optimized for complex tasks requiring extended multi-st...

Qwen: Qwen3 32B

32B
by Alibaba (Qwen Team) · 131,072 ctx

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dial...

Qwen: Qwen3.5-122B-A10B

122B
by Alibaba (Qwen Team) · 262,144 ctx

The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a ...

Qwen: Qwen3.5-27B

27B
by Alibaba (Qwen Team) · 262,144 ctx

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balanc...

Qwen: Qwen3.5-35B-A3B

35B
by Alibaba (Qwen Team) · 262,144 ctx

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture that integrates linear attention mechani...

Qwen: Qwen3.5 397B A17B

397B
by Alibaba (Qwen Team) · 262,144 ctx

The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism ...

Qwen: Qwen3.5-9B

9B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understandi...

Qwen: Qwen3.5-Flash

multimodal
by Alibaba (Qwen Team) · 1,000,000 ctx

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sp...

Qwen: Qwen3.5 Plus 2026-02-15

multimodal
by Alibaba (Qwen Team) · 1,000,000 ctx

The Qwen3.5 native vision-language series Plus models are built on a hybrid architecture that integrates linear attention mechanisms with...

Qwen: Qwen3.5 Plus 2026-04-20

235B
by Alibaba (Qwen Team) · 1,000,000 ctx

Qwen3.5 Plus (April 2026) is a large-scale multimodal language model from Alibaba. It accepts text, image, and video input and produces t...

Qwen: Qwen3.6 27B

27B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3.6 27B is a dense 27-billion-parameter language model from the Qwen Team at Alibaba, released in April 2026. It features hybrid mult...

Qwen: Qwen3.6 35B A3B

35B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3.6-35B-A3B is an open-weight multimodal model from Alibaba Cloud with 35 billion total parameters and 3 billion active parameters pe...

Qwen: Qwen3.6 Flash

multimodal
by Alibaba (Qwen Team) · 1,000,000 ctx

Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It supports text, image, and video input with a 1M toke...

Qwen: Qwen3.6 Max Preview

text
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3.6-Max-Preview is a proprietary frontier model from Alibaba Cloud built on a sparse mixture-of-experts architecture with approximate...

Qwen: Qwen3.6 Plus

multimodal
by Alibaba (Qwen Team) · 1,000,000 ctx

Qwen 3.6 Plus builds on a hybrid architecture that combines efficient linear attention with sparse mixture-of-experts routing, enabling s...

Qwen: Qwen3.7 Max

text
by Alibaba (Qwen Team) · 1,000,000 ctx

Qwen3.7-Max is the flagship model in Alibaba's Qwen3.7 series. It supports text input and output and is designed for agent-centric worklo...

Qwen: Qwen3 8B

8B
by Alibaba (Qwen Team) · 131,072 ctx

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dia...

Qwen: Qwen3 Coder 30B A3B Instruct

30B
by Alibaba (Qwen Team) · 160,000 ctx

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed f...

Qwen: Qwen3 Coder 480B A35B

480B
by Alibaba (Qwen Team) · 1,048,576 ctx

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agenti...

Qwen: Qwen3 Coder Flash

text
by Alibaba (Qwen Team) · 1,000,000 ctx

Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus. It is a powerful coding agent model...

Qwen: Qwen3 Coder Next

480B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local development workflows. It uses a sparse Mo...

Qwen: Qwen3 Coder Plus

text
by Alibaba (Qwen Team) · 1,000,000 ctx

Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializ...

Qwen: Qwen3 Max

235B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual ...

Qwen: Qwen3 Max Thinking

text
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi...

Qwen: Qwen3 Next 80B A3B Instruct

80B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without “thi...

Qwen: Qwen3 Next 80B A3B Thinking

80B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured “thinking” traces by default. ...

Qwen: Qwen3 VL 235B A22B Instruct

235B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across image...

Qwen: Qwen3 VL 235B A22B Thinking

235B
by Alibaba (Qwen Team) · 131,072 ctx

Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with visual understanding across images and video. ...

Qwen: Qwen3 VL 30B A3B Instruct

30B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its ...

Qwen: Qwen3 VL 30B A3B Thinking

30B
by Alibaba (Qwen Team) · 131,072 ctx

Qwen3-VL-30B-A3B-Thinking is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its ...

Qwen: Qwen3 VL 32B Instruct

32B
by Alibaba (Qwen Team) · 262,144 ctx

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across te...

Qwen: Qwen3 VL 8B Instruct

8B
by Alibaba (Qwen Team) · 256,000 ctx

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning ...

Qwen: Qwen3 VL 8B Thinking

8B
by Alibaba (Qwen Team) · 256,000 ctx

Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual rea...

Qwen: Qwen-Plus

text
by Alibaba (Qwen Team) · 1,000,000 ctx

Qwen-Plus, based on the Qwen2.5 foundation model, is a 131K context model with a balanced performance, speed, and cost combination.

Qwen: Qwen Plus 0728

text
by Alibaba (Qwen Team) · 1,000,000 ctx

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, an...

Qwen QwQ-32B

32B
by Alibaba (Qwen Team) · 131,072 ctx

Tongyi DeepResearch 30B A3B

30B
by Alibaba (Qwen Team) · 131,072 ctx

Tongyi DeepResearch is an agentic large language model developed by Tongyi Lab, with 30 billion total parameters activating only 3 billio...