|
Qwen: Qwen2.5 7B Instruct
|
Alibaba (Qwen Team) |
api direct |
$0.04
|
$0.1
|
—
|
—
|
1× Nvidia P102-100
· INT4
|
Open →
|
|
Qwen 2.5 14B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia RTX 3080
· INT4
|
Open →
|
|
Qwen 2.5 32B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia RTX A5000
· INT4
|
Open →
|
|
Qwen 2.5 3B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia GeForce GTX 1050
· INT4
|
Open →
|
|
Qwen 3 235B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× AMD MI300
· INT4
|
Open →
|
|
Qwen 3 32B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia RTX A5000
· INT4
|
Open →
|
|
Qwen 3 14B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia RTX 3080
· INT4
|
Open →
|
|
Qwen 3 8B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia GeForce RTX 2060
· INT4
|
Open →
|
|
Qwen 3 4B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia Titan V
· INT4
|
Open →
|
|
Qwen: Qwen3.6 35B A3B
|
Alibaba (Qwen Team) |
api direct |
$0.15
|
$1.0
|
—
|
—
|
1× Nvidia RTX A5000
· INT4
|
Open →
|
|
Qwen: Qwen3.6 Max Preview
|
Alibaba (Qwen Team) |
api direct |
$1.04
|
$6.24
|
—
|
—
|
API only
|
Open →
|
|
Qwen: Qwen3.6 27B
|
Alibaba (Qwen Team) |
api direct |
$0.3
|
$3.2
|
—
|
—
|
1× Nvidia RTX 4000 Ada
· INT4
|
Open →
|
|
Qwen: Qwen3.6 Plus
|
Alibaba (Qwen Team) |
api direct |
$0.325
|
$1.95
|
—
|
—
|
API only
|
Open →
|
|
Qwen: Qwen3.5-9B
|
Alibaba (Qwen Team) |
api direct |
$0.04
|
$0.15
|
—
|
—
|
1× Nvidia GeForce RTX 2060
· INT4
|
Open →
|
|
Qwen: Qwen3.5-35B-A3B
|
Alibaba (Qwen Team) |
api direct |
$0.139
|
$1.0
|
—
|
—
|
1× Nvidia RTX A5000
· INT4
|
Open →
|
|
Qwen: Qwen3.5 Plus 2026-02-15
|
Alibaba (Qwen Team) |
api direct |
$0.26
|
$1.56
|
—
|
—
|
API only
|
Open →
|
|
Qwen: Qwen3.5 397B A17B
|
Alibaba (Qwen Team) |
api direct |
$0.39
|
$2.34
|
—
|
—
|
1× AMD MI325
· INT4
|
Open →
|
|
Qwen: Qwen3 Max Thinking
|
Alibaba (Qwen Team) |
api direct |
$0.78
|
$3.9
|
—
|
—
|
API only
|
Open →
|
|
Qwen: Qwen3 Coder Next
|
Alibaba (Qwen Team) |
api direct |
$0.11
|
$0.8
|
—
|
—
|
2× AMD MI300
· INT4
|
Open →
|
|
Qwen: Qwen3 VL 32B Instruct
|
Alibaba (Qwen Team) |
api direct |
$0.104
|
$0.416
|
—
|
—
|
1× Nvidia RTX 4000 Ada
· INT4
|
Open →
|
|
Qwen: Qwen3 VL 8B Thinking
|
Alibaba (Qwen Team) |
api direct |
$0.117
|
$1.365
|
—
|
—
|
1× Nvidia P102-100
· INT4
|
Open →
|
|
Qwen: Qwen3 VL 8B Instruct
|
Alibaba (Qwen Team) |
api direct |
$0.08
|
$0.5
|
—
|
—
|
1× Nvidia P102-100
· INT4
|
Open →
|
|
Qwen: Qwen3 VL 30B A3B Thinking
|
Alibaba (Qwen Team) |
api direct |
$0.13
|
$1.56
|
—
|
—
|
1× Nvidia RTX 4000 Ada
· INT4
|
Open →
|
|
Qwen: Qwen3 VL 30B A3B Instruct
|
Alibaba (Qwen Team) |
api direct |
$0.13
|
$0.52
|
—
|
—
|
1× Nvidia RTX 4000 Ada
· INT4
|
Open →
|
|
Qwen: Qwen3 VL 235B A22B Thinking
|
Alibaba (Qwen Team) |
api direct |
$0.26
|
$2.6
|
—
|
—
|
1× AMD MI300
· INT4
|
Open →
|
|
Qwen: Qwen3 VL 235B A22B Instruct
|
Alibaba (Qwen Team) |
api direct |
$0.2
|
$0.88
|
—
|
—
|
1× AMD MI300
· INT4
|
Open →
|
|
Qwen: Qwen3 Max
|
Alibaba (Qwen Team) |
api direct |
$0.78
|
$3.9
|
—
|
—
|
1× AMD MI300
· INT4
|
Open →
|
|
Qwen: Qwen3 Coder Plus
|
Alibaba (Qwen Team) |
api direct |
$0.65
|
$3.25
|
—
|
—
|
API only
|
Open →
|
|
Tongyi DeepResearch 30B A3B
|
Alibaba (Qwen Team) |
api direct |
$0.09
|
$0.45
|
—
|
—
|
1× Nvidia RTX 4000 Ada
· INT4
|
Open →
|
|
Qwen3 Coder 480B A35B Instruct Fp8
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
2× AMD MI300
· INT4
|
Open →
|
|
Qwen3 235B A22B Instruct 2507 FP8 Throughput
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× AMD MI300
· INT4
|
Open →
|
|
Qwen2.5 7B Instruct Turbo
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia P102-100
· INT4
|
Open →
|
|
Qwen3 4B Base
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia Titan V
· INT4
|
Open →
|
|
Qwen 2 (1.5B)
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia P104-100
· INT4
|
Open →
|
|
Qwen2.5 32B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia RTX 4000 Ada
· INT4
|
Open →
|
|
Qwen 2 (72B)
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia A40
· INT4
|
Open →
|
|
Qwen QwQ-32B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia RTX 4000 Ada
· INT4
|
Open →
|
|
Qwen 2 Instruct (1.5B)
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia P104-100
· INT4
|
Open →
|
|
Qwen 2 (7B)
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia P102-100
· INT4
|
Open →
|
|
Qwen2.5 1.5B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia P104-100
· INT4
|
Open →
|
|
Qwen2.5 1.5B Instruct
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia P104-100
· INT4
|
Open →
|
|
Qwen2.5 14B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia RTX 3080
· INT4
|
Open →
|
|
Qwen2.5 3B Instruct
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia GeForce GTX 1050
· INT4
|
Open →
|
|
Qwen2.5 72B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia A40
· INT4
|
Open →
|
|
Qwen2.5 7B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia P102-100
· INT4
|
Open →
|
|
Qwen2.5 7B Instruct
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia P102-100
· INT4
|
Open →
|
|
Qwen 2.5 Coder 32B Instruct
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia RTX 4000 Ada
· INT4
|
Open →
|
|
Qwen3 0.6B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia P104-100
· INT4
|
Open →
|
|
Qwen3 0.6B Base
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia P104-100
· INT4
|
Open →
|
|
Qwen3 1.7B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia P102-100
· INT4
|
Open →
|
|
Qwen3 1.7B Base
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia P102-100
· INT4
|
Open →
|
|
Qwen3 14B Base
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia RTX 3080
· INT4
|
Open →
|
|
Qwen3 30B A3b Base
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia RTX 4000 Ada
· INT4
|
Open →
|
|
Qwen3 8B Base
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia P102-100
· INT4
|
Open →
|
|
Qwen3 Next 80B A3b Instruct Fp8
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia A16
· INT4
|
Open →
|
|
Qwen3-VL-235B-A22B-Instruct-FP8
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× AMD MI300
· INT4
|
Open →
|
|
Qwen3 30B A3B Instruct 2507 Lora
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia RTX 4000 Ada
· INT4
|
Open →
|
|
Qwen3 4B Instruct 2507
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia Titan V
· INT4
|
Open →
|
|
Qwen3 8B Lora
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia P102-100
· INT4
|
Open →
|
|
Qwen 2.5 14B Instruct
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia RTX 3080
· INT4
|
Open →
|
|
Qwen3.6 35B A3b Fp8
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× AMD Radeon RX 7900 XTX
· INT4
|
Open →
|
|
Qwen3.5 122B A10b Fp8
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia A100
· INT4
|
Open →
|
|
Qwen3 Coder Next Fp8
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
API only
|
Open →
|
|
Qwen2.5 72B Instruct
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia A40
· INT4
|
Open →
|
|
Qwen2.5 32B Instruct
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia RTX 4000 Ada
· INT4
|
Open →
|
|
Qwen2.5 72B Instruct Turbo
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia A40
· INT4
|
Open →
|
|
Qwen2-VL (72B) Instruct
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia A40
· INT4
|
Open →
|
|
Qwen3.5 9B Fp8
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia RTX A2000
· INT4
|
Open →
|
|
Qwen3-235B-A22B-Instruct-2507
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× AMD MI300
· INT4
|
Open →
|
|
Qwen3-Coder-480B-A35B-Instruct-Turbo
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
2× AMD MI300
· INT4
|
Open →
|
|
Qwen3.5-0.8B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia P102-100
· INT4
|
Open →
|
|
Qwen3.5-2B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia Titan V
· FP8
|
Open →
|
|
Qwen3.5-4B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia Titan V
· INT4
|
Open →
|
|
Qwen 2.5 7B
|
Alibaba (Qwen Team) |
api direct |
—
|
—
|
—
|
—
|
1× Nvidia P102-100
· INT4
|
Open →
|
|
Qwen 2.5 Coder 32B
|
Alibaba (Qwen Team) |
api direct |
$0.66
|
$1.0
|
—
|
—
|
1× Nvidia RTX A5000
· INT4
|
Open →
|
|
Qwen: Qwen3.7 Max
|
Alibaba (Qwen Team) |
api direct |
$2.5
|
$7.5
|
—
|
—
|
API only
|
Open →
|
|
Qwen: Qwen3.5 Plus 2026-04-20
|
Alibaba (Qwen Team) |
api direct |
$0.3
|
$1.8
|
—
|
—
|
1× AMD MI300
· INT4
|
Open →
|
|
Qwen: Qwen3.6 Flash
|
Alibaba (Qwen Team) |
api direct |
$0.1875
|
$1.125
|
—
|
—
|
API only
|
Open →
|
|
Qwen: Qwen3.5-27B
|
Alibaba (Qwen Team) |
api direct |
$0.195
|
$1.56
|
—
|
—
|
1× Nvidia RTX 4000 Ada
· INT4
|
Open →
|
|
Qwen: Qwen3.5-122B-A10B
|
Alibaba (Qwen Team) |
api direct |
$0.26
|
$2.08
|
—
|
—
|
1× Nvidia H100
· INT4
|
Open →
|
|
Qwen: Qwen3.5-Flash
|
Alibaba (Qwen Team) |
api direct |
$0.065
|
$0.26
|
—
|
—
|
API only
|
Open →
|
|
Qwen: Qwen3 Coder Flash
|
Alibaba (Qwen Team) |
api direct |
$0.195
|
$0.975
|
—
|
—
|
API only
|
Open →
|
|
Qwen: Qwen3 Next 80B A3B Thinking
|
Alibaba (Qwen Team) |
api direct |
$0.0975
|
$0.78
|
—
|
—
|
1× Nvidia A16
· INT4
|
Open →
|
|
Qwen: Qwen3 Next 80B A3B Instruct
|
Alibaba (Qwen Team) |
api direct |
$0.09
|
$1.1
|
—
|
—
|
1× Nvidia A16
· INT4
|
Open →
|
|
Qwen: Qwen Plus 0728
|
Alibaba (Qwen Team) |
api direct |
$0.26
|
$0.78
|
—
|
—
|
API only
|
Open →
|
|
Qwen: Qwen3 30B A3B Thinking 2507
|
Alibaba (Qwen Team) |
api direct |
$0.08
|
$0.4
|
—
|
—
|
1× Nvidia RTX 4000 Ada
· INT4
|
Open →
|
|
Qwen: Qwen3 Coder 30B A3B Instruct
|
Alibaba (Qwen Team) |
api direct |
$0.07
|
$0.27
|
—
|
—
|
1× Nvidia RTX 4000 Ada
· INT4
|
Open →
|
|
Qwen: Qwen3 30B A3B Instruct 2507
|
Alibaba (Qwen Team) |
api direct |
$0.09
|
$0.3
|
—
|
—
|
1× Nvidia RTX 4000 Ada
· INT4
|
Open →
|
|
Qwen: Qwen3 235B A22B Thinking 2507
|
Alibaba (Qwen Team) |
api direct |
$0.1495
|
$1.495
|
—
|
—
|
1× AMD MI300
· INT4
|
Open →
|
|
Qwen: Qwen3 Coder 480B A35B
|
Alibaba (Qwen Team) |
api direct |
$0.22
|
$1.8
|
—
|
—
|
2× AMD MI300
· INT4
|
Open →
|
|
Qwen: Qwen3 235B A22B Instruct 2507
|
Alibaba (Qwen Team) |
api direct |
$0.071
|
$0.1
|
—
|
—
|
1× AMD MI300
· INT4
|
Open →
|
|
Qwen: Qwen3 30B A3B
|
Alibaba (Qwen Team) |
api direct |
$0.09
|
$0.45
|
—
|
—
|
1× Nvidia RTX 4000 Ada
· INT4
|
Open →
|
|
Qwen: Qwen3 8B
|
Alibaba (Qwen Team) |
api direct |
$0.05
|
$0.4
|
—
|
—
|
1× Nvidia P102-100
· INT4
|
Open →
|
|
Qwen: Qwen3 14B
|
Alibaba (Qwen Team) |
api direct |
$0.1
|
$0.24
|
—
|
—
|
1× Nvidia RTX 3080
· INT4
|
Open →
|
|
Qwen: Qwen3 32B
|
Alibaba (Qwen Team) |
api direct |
$0.08
|
$0.28
|
—
|
—
|
1× Nvidia RTX 4000 Ada
· INT4
|
Open →
|
|
Qwen: Qwen3 235B A22B
|
Alibaba (Qwen Team) |
api direct |
$0.455
|
$1.82
|
—
|
—
|
1× AMD MI300
· INT4
|
Open →
|
|
Qwen: Qwen2.5 VL 72B Instruct
|
Alibaba (Qwen Team) |
api direct |
$0.25
|
$0.75
|
—
|
—
|
1× Nvidia L40S
· INT4
|
Open →
|
|
Qwen: Qwen-Plus
|
Alibaba (Qwen Team) |
api direct |
$0.26
|
$0.78
|
—
|
—
|
API only
|
Open →
|
|
Qwen 2.5 72B
|
Alibaba (Qwen Team) |
api direct |
$0.36
|
$0.4
|
—
|
—
|
1× Nvidia L40S
· INT4
|
Open →
|