AI models

Every way
to use the major models.

Closed models like Claude and GPT — link to the cheapest API provider. Open-weights like Llama, Kimi, DeepSeek — choose hosted inference or self-host on rented GPUs.

186 tracked · 73 open weights · 113 closed APIs · cheapest input $0.01/M

Quality × Price

Find the sweet spot.

Higher = stronger benchmark composite · further left = cheaper input

Modality All Text Multimodal Code Image Audio Video Vision Embedding

License All Open weights Closed / API

Size All ≤8B (edge) 8–30B (laptop) 30–100B (workstation) 100B+ (datacenter)

186 models match — reset filters

Open-weights models.

Run yourself on cheap GPUs, or use a hosted-inference provider.

Gemma 3 27B

27B

by Google DeepMind · Gemma · 128,000 ctx

Google's open-weight multimodal LLM — efficient and license-permissive.

Gemma 3 12B

12B

by Google DeepMind · Gemma · 128,000 ctx

12B Gemma 3 — multimodal, single-GPU target.

Gemma 3 4B

by Google DeepMind · Gemma · 128,000 ctx

4B Gemma 3 — laptop multimodal.

Kimi K2.5

1000B

by Moonshot AI · Kimi · 256,000 ctx

Multimodal agentic variant — adds a vision encoder to the K2 backbone.

Arcee AI: Spotlight

by Arcee Ai · 131,072 ctx

Spotlight is a 7‑billion‑parameter vision‑language model derived from Qwen 2.5‑VL and fine‑tuned by Arcee AI for tight image‑text groundi...

Baidu: ERNIE 4.5 VL 28B A3B

28B

by Baidu · 131,072 ctx

A powerful multimodal Mixture-of-Experts chat model featuring 28B total parameters with 3B activated per token, delivering exceptional te...

Baidu: ERNIE 4.5 VL 424B A47B

424B

by Baidu · 131,072 ctx

ERNIE-4.5-VL-424B-A47B is a multimodal Mixture-of-Experts (MoE) model from Baidu’s ERNIE 4.5 series, featuring 424B total parameters with...

Every wayto use the major models.

Find the sweet spot.

Open-weights models.

Gemma 3 27B

Gemma 3 12B

Gemma 3 4B

Kimi K2.5

Arcee AI: Spotlight

Baidu: ERNIE 4.5 VL 28B A3B

Baidu: ERNIE 4.5 VL 424B A47B

Baidu: Qianfan-OCR-Fast

ByteDance Seed: Seed 1.6

ByteDance Seed: Seed 1.6 Flash

ByteDance Seed: Seed-2.0-Lite

ByteDance Seed: Seed-2.0-Mini

ByteDance: UI-TARS 7B

gemini-3.1-pro

Kimi 2.7 Code

Kimi K2.5

Kimi K2.6

Llama-3.2-11B-Vision-Instruct

Meta: Llama 4 Maverick

Meta: Llama 4 Scout

Meta: Llama Guard 4 12B

MiniMax: MiniMax-01

Mistral: Ministral 3 14B 2512

Mistral: Ministral 3 3B 2512

Mistral: Ministral 3 8B 2512

Mistral: Mistral Large 3 2512

Mistral: Mistral Medium 3

Mistral: Mistral Medium 3.1

Mistral: Mistral Medium 3.5

Mistral: Mistral Small 3.1 24B

Mistral: Mistral Small 3.2 24B

Mistral: Mistral Small 4

Mistral: Pixtral Large 2411

Mistral-Small-3.2-24B-Instruct-2506

MoonshotAI Kimi Latest

Nemotron-3-Nano-Omni-30B-A3B-Reasoning

Nemotron-Content-Safety-3.5

NVIDIA-Nemotron-3-Ultra-550B-A55B

NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16

Ornith-1.0-35B

Perceptron: Perceptron Mk1

Qwen3.5-0.8B

Qwen3.5-2B

Qwen3.5-4B

Qwen: Qwen2.5 VL 72B Instruct

Qwen: Qwen3.5-122B-A10B

Qwen: Qwen3.5-27B

Qwen: Qwen3.5-35B-A3B

Qwen: Qwen3.5 397B A17B

Qwen: Qwen3.5-9B

Qwen: Qwen3.5-Flash

Qwen: Qwen3.5 Plus 2026-02-15

Qwen: Qwen3.5 Plus 2026-04-20

Qwen: Qwen3.6 27B

Qwen: Qwen3.6 35B A3B

Qwen: Qwen3.6 Flash

Qwen: Qwen3.6 Plus

Qwen: Qwen3 VL 235B A22B Instruct

Qwen: Qwen3 VL 235B A22B Thinking

Qwen: Qwen3 VL 30B A3B Instruct

Qwen: Qwen3 VL 30B A3B Thinking

Qwen: Qwen3 VL 32B Instruct

Qwen: Qwen3 VL 8B Instruct

Qwen: Qwen3 VL 8B Thinking

Reka Edge

Seed-1.8

Seed-2.0-code

Seed-2.0-pro

Xiaomi: MiMo-V2.5

Xiaomi: MiMo-V2-Omni

Z.ai: GLM 4.5V

Z.ai: GLM 4.6V

Z.ai: GLM 5V Turbo

Closed / API-only models.

GPT-4o

GPT-4o Mini

Gemini 2.5 Pro

Every way
to use the major models.