AI models

Every way
to use the major models.

Closed models like Claude and GPT — link to the cheapest API provider. Open-weights like Llama, Kimi, DeepSeek — choose hosted inference or self-host on rented GPUs.

132 tracked · 0 open weights · 132 closed APIs · cheapest input $0.04/M
Quality × Price

Find the sweet spot.

Higher = stronger benchmark composite · further left = cheaper input

Loading...

132 models match — reset filters

Closed / API-only models.

Direct API, aggregator (OpenRouter, Bedrock), or chat UI.

Claude Opus 4.7

text
by Anthropic · Claude · 200,000 ctx

Frontier reasoning and long-form coding from Anthropic.

Claude Sonnet 4.6

text
by Anthropic · Claude · 200,000 ctx

Best price-performance from Anthropic. Default for production agents.

Claude 3.5 Sonnet

text
by Anthropic · Claude · 200,000 ctx

Anthropic's 3.5 generation — still in active production.

Claude Haiku 4.5

text
by Anthropic · Claude · 200,000 ctx

Fast, cheap Claude variant for high-throughput inference.

Claude 3.5 Haiku

text
by Anthropic · Claude · 200,000 ctx

Fast/cheap Claude 3.5 variant — production fallback for Haiku 4.5.

GPT-5

text
by OpenAI · GPT · 256,000 ctx

OpenAI's frontier multimodal reasoning model.

GPT-4 Turbo

text
by OpenAI · GPT · 128,000 ctx

OpenAI's pre-GPT-5 flagship — still extensively deployed.

GPT-4o

multimodal
by OpenAI · GPT · 128,000 ctx

OpenAI's multimodal model — text, vision, audio in one.

GPT-4o Mini

multimodal
by OpenAI · GPT · 128,000 ctx

Cheap multimodal default — replaced GPT-3.5 Turbo for low-cost workloads.

Gemini 2.5 Pro

multimodal
by Google DeepMind · Gemini · 1,000,000 ctx

Google's frontier reasoning model with native 1M-token context.

Gemini 1.5 Pro

multimodal
by Google DeepMind · Gemini · 1,000,000 ctx

Google's pre-2.5 frontier — 2M context launched here.

Gemini 1.5 Flash

multimodal
by Google DeepMind · Gemini · 1,000,000 ctx

Cheap fast Gemini — production default before 2.0/2.5 Flash.

Grok 3

multimodal
by xAI · Grok · 1,000,000 ctx

xAI's frontier model with built-in DeepSearch + real-time X integration.

AI21: Jamba Large 1.7

text
by Ai21 · 256,000 ctx

Jamba Large 1.7 is the latest model in the Jamba open family, offering improvements in grounding, instruction-following, and overall effi...

Amazon: Nova 2 Lite

multimodal
by Amazon · 1,000,000 ctx

Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can process text, images, and videos to generate text. ...

Amazon: Nova Lite 1.0

multimodal
by Amazon · 300,000 ctx

Amazon Nova Lite 1.0 is a very low-cost multimodal model from Amazon that focused on fast processing of image, video, and text inputs to ...

Amazon: Nova Micro 1.0

text
by Amazon · 128,000 ctx

Amazon Nova Micro 1.0 is a text-only model that delivers the lowest latency responses in the Amazon Nova family of models at a very low c...

Amazon: Nova Premier 1.0

multimodal
by Amazon · 1,000,000 ctx

Amazon Nova Premier is the most capable of Amazon’s multimodal models for complex reasoning tasks and for use as the best teacher for dis...

Amazon: Nova Pro 1.0

multimodal
by Amazon · 300,000 ctx

Amazon Nova Pro 1.0 is a capable multimodal model from Amazon focused on providing a combination of accuracy, speed, and cost for a wide ...

Anthropic: Claude 3 Haiku

multimodal
by Anthropic · 200,000 ctx

Claude 3 Haiku is Anthropic's fastest and most compact model for near-instant responsiveness. Quick and accurate targeted performance. S...

Anthropic Claude Haiku Latest

multimodal
by Anthropic · 200,000 ctx

This model always redirects to the latest model in the Anthropic Claude Haiku family.

Anthropic: Claude Opus 4

multimodal
by Anthropic · 200,000 ctx

Claude Opus 4 is benchmarked as the world’s best coding model, at time of release, bringing sustained performance on complex, long-runnin...

Anthropic: Claude Opus 4.1

multimodal
by Anthropic · 200,000 ctx

Claude Opus 4.1 is an updated version of Anthropic’s flagship model, offering improved performance in coding, reasoning, and agentic task...

Anthropic: Claude Opus 4.5

multimodal
by Anthropic · 200,000 ctx

Claude Opus 4.5 is Anthropic’s frontier reasoning model optimized for complex software engineering, agentic workflows, and long-horizon c...

Anthropic: Claude Opus 4.6

multimodal
by Anthropic · 1,000,000 ctx

Opus 4.6 is Anthropic’s strongest model for coding and long-running professional tasks. It is built for agents that operate across entire...

Anthropic: Claude Opus 4.6 (Fast)

multimodal
by Anthropic · 1,000,000 ctx

Fast-mode variant of [Opus 4.6](/anthropic/claude-opus-4.6) - identical capabilities with higher output speed at premium 6x pricing. Lea...

Anthropic: Claude Opus 4.7 (Fast)

multimodal
by Anthropic · 1,000,000 ctx

Fast-mode variant of [Opus 4.7](/anthropic/claude-opus-4.7) - identical capabilities with higher output speed at premium 6x pricing. Lea...

Anthropic: Claude Opus Latest

multimodal
by Anthropic · 1,000,000 ctx

This model always redirects to the latest model in the Claude Opus family.

Anthropic: Claude Sonnet 4

multimodal
by Anthropic · 1,000,000 ctx

Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with...

Anthropic: Claude Sonnet 4.5

multimodal
by Anthropic · 1,000,000 ctx

Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers st...

Anthropic Claude Sonnet Latest

multimodal
by Anthropic · 1,000,000 ctx

This model always redirects to the latest model in the Anthropic Claude Sonnet family.

Auto Router

multimodal
by Openrouter · 2,000,000 ctx

Your prompt will be processed by a meta-model and routed to one of dozens of models (see below), optimizing for the best possible output....

Body Builder (beta)

text
by Openrouter · 128,000 ctx

Transform your natural language requests into structured OpenRouter API request objects. Describe what you want to accomplish with AI mod...

Cohere: Command A

text
by Cohere · 256,000 ctx

Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, mult...

Cohere: Command R (08-2024)

text
by Cohere · 128,000 ctx

command-r-08-2024 is an update of the [Command R](/models/cohere/command-r) with improved performance for multilingual retrieval-augmente...

Cohere: Command R+ (08-2024)

text
by Cohere · 128,000 ctx

command-r-plus-08-2024 is an update of the [Command R+](/models/cohere/command-r-plus) with roughly 50% higher throughput and 25% lower l...

Cohere: Command R7B (12-2024)

text
by Cohere · 128,000 ctx

Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in December 2024. It excels at RAG, tool use, agents, an...

Free Models Router

multimodal
by Openrouter · 200,000 ctx

The simplest way to get free inference. openrouter/free is a router that selects free models at random from the models available on OpenR...

Google: Gemini 2.0 Flash

multimodal
by Google DeepMind · 1,000,000 ctx

Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while...

Google: Gemini 2.0 Flash Lite

multimodal
by Google DeepMind · 1,048,576 ctx

Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), ...

Google: Gemini 2.5 Flash

multimodal
by Google DeepMind · 1,048,576 ctx

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and sci...

Google: Gemini 2.5 Flash Lite

multimodal
by Google DeepMind · 1,048,576 ctx

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It ...

Google: Gemini 2.5 Flash Lite Preview 09-2025

multimodal
by Google DeepMind · 1,048,576 ctx

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It ...

Google: Gemini 2.5 Pro Preview 05-06

multimodal
by Google DeepMind · 1,048,576 ctx

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It emplo...

Google: Gemini 2.5 Pro Preview 06-05

multimodal
by Google DeepMind · 1,048,576 ctx

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It emplo...

Google: Gemini 3.1 Flash Lite

multimodal
by Google DeepMind · 1,048,576 ctx

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text,...

Google: Gemini 3.1 Flash Lite Preview

multimodal
by Google DeepMind · 1,048,576 ctx

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite...

Google: Gemini 3.1 Pro Preview

multimodal
by Google DeepMind · 1,048,576 ctx

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic relia...

Google: Gemini 3.1 Pro Preview Custom Tools

multimodal
by Google DeepMind · 1,048,756 ctx

Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool selection behavior by preventing overuse of a gener...

Google: Gemini 3.5 Flash

multimodal
by Google DeepMind · 1,048,576 ctx

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed....

Google: Gemini 3 Flash Preview

multimodal
by Google DeepMind · 1,048,576 ctx

Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance....

Google Gemini Flash Latest

multimodal
by ~google · 1,048,576 ctx

This model always redirects to the latest model in the Google Gemini Flash family.

Google Gemini Pro Latest

multimodal
by ~google · 1,048,576 ctx

This model always redirects to the latest model in the Google Gemini Pro family.

Google: Gemma 2 27B

27B
by Google DeepMind · 8,192 ctx

Gemma 2 27B by Google is an open model built from the same research and technology used to create the [Gemini models](/models?q=gemini). ...

Google: Gemma 3 12B

12B
by Google DeepMind · 131,072 ctx

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, unders...

Google: Gemma 3n 4B

4B
by Google DeepMind · 32,768 ctx

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It support...

Google: Gemma 4 26B A4B

26B
by Google DeepMind · 262,144 ctx

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B...

Google: Gemma 4 31B

31B
by Google DeepMind · 262,144 ctx

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K ...

Google: Lyria 3 Clip Preview

multimodal
by Google DeepMind · 1,048,576 ctx

30 second duration clips are priced at $0.04 per clip. Lyria 3 is Google's family of music generation models, available through the Gemin...

Google: Lyria 3 Pro Preview

multimodal
by Google DeepMind · 1,048,576 ctx

Full-length songs are priced at $0.08 per song. Lyria 3 is Google's family of music generation models, available through the Gemini API. ...

Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)

multimodal
by Google DeepMind · 131,072 ctx

Gemini 3.1 Flash Image Preview, a.k.a. "Nano Banana 2," is Google’s latest state of the art image generation and editing model, deliverin...

Google: Nano Banana (Gemini 2.5 Flash Image)

multimodal
by Google DeepMind · 32,768 ctx

Gemini 2.5 Flash Image, a.k.a. "Nano Banana," is now generally available. It is a state of the art image generation model with contextual...

Google: Nano Banana Pro (Gemini 3 Pro Image Preview)

multimodal
by Google DeepMind · 65,536 ctx

Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro. It extends the original Nano Banana ...

Inflection: Inflection 3 Pi

text
by Inflection · 8,000 ctx

Inflection 3 Pi powers Inflection's [Pi](https://pi.ai) chatbot, including backstory, emotional intelligence, productivity, and safety. I...

Inflection: Inflection 3 Productivity

text
by Inflection · 8,000 ctx

Inflection 3 Productivity is optimized for following instructions. It is better for tasks requiring JSON output or precise adherence to p...

OpenAI: GPT-3.5 Turbo

text
by OpenAI · 16,385 ctx

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and tradition...

OpenAI: GPT-3.5 Turbo 16k

text
by OpenAI · 16,385 ctx

This model offers four times the context length of gpt-3.5-turbo, allowing it to support approximately 20 pages of text in a single reque...

OpenAI: GPT-3.5 Turbo Instruct

text
by OpenAI · 4,095 ctx

This model is a variant of GPT-3.5 Turbo tuned for instructional prompts and omitting chat-related optimizations. Training data: up to Se...

OpenAI: GPT-3.5 Turbo (older v0613)

text
by OpenAI · 4,095 ctx

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and tradition...

OpenAI: GPT-4

text
by OpenAI · 8,191 ctx

OpenAI's flagship model, GPT-4 is a large-scale multimodal language model capable of solving difficult problems with greater accuracy tha...

OpenAI: GPT-4.1

multimodal
by OpenAI · 1,047,576 ctx

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-contex...

OpenAI: GPT-4.1 Mini

multimodal
by OpenAI · 1,047,576 ctx

GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 ...

OpenAI: GPT-4.1 Nano

multimodal
by OpenAI · 1,047,576 ctx

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performa...

OpenAI: GPT-4o (2024-05-13)

multimodal
by OpenAI · 128,000 ctx

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligen...

OpenAI: GPT-4o (2024-08-06)

multimodal
by OpenAI · 128,000 ctx

The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the ability to supply a JSON schema in the respo...

OpenAI: GPT-4o (2024-11-20)

multimodal
by OpenAI · 128,000 ctx

The 2024-11-20 version of GPT-4o offers a leveled-up creative writing ability with more natural, engaging, and tailored writing to improv...

OpenAI: GPT-4 (older v0314)

text
by OpenAI · 8,191 ctx

GPT-4-0314 is the first version of GPT-4 released, with a context length of 8,192 tokens, and was supported until June 14. Training data:...

OpenAI: GPT-4o-mini (2024-07-18)

multimodal
by OpenAI · 128,000 ctx

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. ...

OpenAI: GPT-4o-mini Search Preview

text
by OpenAI · 128,000 ctx

GPT-4o mini Search Preview is a specialized model for web search in Chat Completions. It is trained to understand and execute web search ...

OpenAI: GPT-4o Search Preview

text
by OpenAI · 128,000 ctx

GPT-4o Search Previewis a specialized model for web search in Chat Completions. It is trained to understand and execute web search queries.

OpenAI: GPT-4 Turbo (older v1106)

text
by OpenAI · 128,000 ctx

The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to ...

OpenAI: GPT-4 Turbo Preview

text
by OpenAI · 128,000 ctx

The preview GPT-4 model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Traini...

OpenAI: GPT-5.1

multimodal
by OpenAI · 400,000 ctx

GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-purpose reasoning, improved instruction adheren...

OpenAI: GPT-5.1 Chat

multimodal
by OpenAI · 128,000 ctx

GPT-5.1 Chat (AKA Instant is the fast, lightweight member of the 5.1 family, optimized for low-latency chat while retaining strong genera...

OpenAI: GPT-5.1-Codex

multimodal
by OpenAI · 400,000 ctx

GPT-5.1-Codex is a specialized version of GPT-5.1 optimized for software engineering and coding workflows. It is designed for both intera...

OpenAI: GPT-5.1-Codex-Max

multimodal
by OpenAI · 400,000 ctx

GPT-5.1-Codex-Max is OpenAI’s latest agentic coding model, designed for long-running, high-context software development tasks. It is base...

OpenAI: GPT-5.1-Codex-Mini

multimodal
by OpenAI · 400,000 ctx

GPT-5.1-Codex-Mini is a smaller and faster version of GPT-5.1-Codex

OpenAI: GPT-5.2

multimodal
by OpenAI · 400,000 ctx

GPT-5.2 is the latest frontier-grade model in the GPT-5 series, offering stronger agentic and long context perfomance compared to GPT-5.1...

OpenAI: GPT-5.2 Chat

multimodal
by OpenAI · 128,000 ctx

GPT-5.2 Chat (AKA Instant) is the fast, lightweight member of the 5.2 family, optimized for low-latency chat while retaining strong gener...

OpenAI: GPT-5.2-Codex

multimodal
by OpenAI · 400,000 ctx

GPT-5.2-Codex is an upgraded version of GPT-5.1-Codex optimized for software engineering and coding workflows. It is designed for both in...

OpenAI: GPT-5.2 Pro

multimodal
by OpenAI · 400,000 ctx

GPT-5.2 Pro is OpenAI’s most advanced model, offering major improvements in agentic coding and long context performance over GPT-5 Pro. I...

OpenAI: GPT-5.3 Chat

multimodal
by OpenAI · 128,000 ctx

GPT-5.3 Chat is an update to ChatGPT's most-used model that makes everyday conversations smoother, more useful, and more directly helpful...

OpenAI: GPT-5.3-Codex

multimodal
by OpenAI · 400,000 ctx

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier software engineering performance of GPT-5.2-Codex wi...

OpenAI: GPT-5.4

multimodal
by OpenAI · 1,050,000 ctx

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window ...

OpenAI: GPT-5.4 Image 2

multimodal
by OpenAI · 272,000 ctx

[GPT-5.4](https://openrouter.ai/openai/gpt-5.4) Image 2 combines OpenAI's GPT-5.4 model with state-of-the-art image generation capabiliti...

OpenAI: GPT-5.4 Mini

multimodal
by OpenAI · 400,000 ctx

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It suppor...

OpenAI: GPT-5.4 Nano

multimodal
by OpenAI · 400,000 ctx

GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks...

OpenAI: GPT-5.4 Pro

multimodal
by OpenAI · 1,050,000 ctx

GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex,...

OpenAI: GPT-5.5

multimodal
by OpenAI · 1,050,000 ctx

GPT-5.5 is OpenAI’s frontier model designed for complex professional workloads, building on GPT-5.4 with stronger reasoning, higher relia...

OpenAI: GPT-5.5 Pro

multimodal
by OpenAI · 1,050,000 ctx

GPT-5.5 Pro is OpenAI’s high-capability model optimized for deep reasoning and accuracy on complex, high-stakes workloads. It features a ...

OpenAI: GPT-5 Chat

multimodal
by OpenAI · 128,000 ctx

GPT-5 Chat is designed for advanced, natural, multimodal, and context-aware conversations for enterprise applications.

OpenAI: GPT-5 Codex

multimodal
by OpenAI · 400,000 ctx

GPT-5-Codex is a specialized version of GPT-5 optimized for software engineering and coding workflows. It is designed for both interactiv...

OpenAI: GPT-5 Image

multimodal
by OpenAI · 400,000 ctx

[GPT-5](https://openrouter.ai/openai/gpt-5) Image combines OpenAI's GPT-5 model with state-of-the-art image generation capabilities. It o...

OpenAI: GPT-5 Image Mini

multimodal
by OpenAI · 400,000 ctx

GPT-5 Image Mini combines OpenAI's advanced language capabilities, powered by [GPT-5 Mini](https://openrouter.ai/openai/gpt-5-mini), with...

OpenAI: GPT-5 Mini

multimodal
by OpenAI · 400,000 ctx

GPT-5 Mini is a compact version of GPT-5, designed to handle lighter-weight reasoning tasks. It provides the same instruction-following a...

OpenAI: GPT-5 Nano

multimodal
by OpenAI · 400,000 ctx

GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for developer tools, rapid interactions, and ultra-low late...

OpenAI: GPT-5 Pro

multimodal
by OpenAI · 400,000 ctx

GPT-5 Pro is OpenAI’s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized f...

OpenAI: GPT Chat Latest

multimodal
by OpenAI · 400,000 ctx

GPT Chat Latest points to OpenAI's stable API alias `chat-latest` that always resolves to the latest Instant chat model used in ChatGPT. ...

OpenAI GPT Latest

multimodal
by OpenAI · 1,050,000 ctx

This model always redirects to the latest model in the OpenAI GPT family.

OpenAI GPT Mini Latest

multimodal
by OpenAI · 400,000 ctx

This model always redirects to the latest model in the OpenAI GPT Mini family.

OpenAI: gpt-oss-safeguard-20b

20B
by OpenAI · 131,072 ctx

gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This open-weight, 21B-parameter Mixture-of-Experts ...

OpenAI: o1

multimodal
by OpenAI · 200,000 ctx

The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 model series is t...

OpenAI: o1-pro

multimodal
by OpenAI · 200,000 ctx

The o1 series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o1-pro mod...

OpenAI: o3

multimodal
by OpenAI · 200,000 ctx

o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It ...

OpenAI: o3 Deep Research

multimodal
by OpenAI · 200,000 ctx

o3-deep-research is OpenAI's advanced model for deep research, designed to tackle complex, multi-step research tasks. Note: This model a...

OpenAI: o3 Mini

text
by OpenAI · 200,000 ctx

OpenAI o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly excelling in science, mathematics, and...

OpenAI: o3 Mini High

text
by OpenAI · 200,000 ctx

OpenAI o3-mini-high is the same model as [o3-mini](/openai/o3-mini) with reasoning_effort set to high. o3-mini is a cost-efficient langua...

OpenAI: o3 Pro

multimodal
by OpenAI · 200,000 ctx

The o-series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o3-pro mode...

OpenAI: o4 Mini

multimodal
by OpenAI · 200,000 ctx

OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multim...

OpenAI: o4 Mini Deep Research

multimodal
by OpenAI · 200,000 ctx

o4-mini-deep-research is OpenAI's faster, more affordable deep research model—ideal for tackling complex, multi-step research tasks. Not...

OpenAI: o4 Mini High

multimodal
by OpenAI · 200,000 ctx

OpenAI o4-mini-high is the same model as [o4-mini](/openai/o4-mini) with reasoning_effort set to high. OpenAI o4-mini is a compact reason...

Owl Alpha

text
by Openrouter · 1,048,756 ctx

Owl Alpha is a high-performance foundation model designed for agentic workloads. Natively supports tool use, and long-context tasks, with...

Pareto Code Router

text
by Openrouter · 2,000,000 ctx

The Pareto Router maintains a tiered shortlist of strong coding models, ranked by [Artificial Analysis](https://artificialanalysis.ai/) c...

Perplexity: Sonar

multimodal
by Perplexity · 127,072 ctx

Sonar is lightweight, affordable, fast, and simple to use — now featuring citations and the ability to customize sources. It is designed ...

Perplexity: Sonar Deep Research

text
by Perplexity · 128,000 ctx

Sonar Deep Research is a research-focused model designed for multi-step retrieval, synthesis, and reasoning across complex topics. It aut...

Perplexity: Sonar Pro

multimodal
by Perplexity · 200,000 ctx

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://docs.perplexity.ai/guides/pricing#detailed-pricing...

Perplexity: Sonar Pro Search

multimodal
by Perplexity · 200,000 ctx

Exclusively available on the OpenRouter API, Sonar Pro's new Pro Search mode is Perplexity's most advanced agentic search system. It is d...

Perplexity: Sonar Reasoning Pro

multimodal
by Perplexity · 128,000 ctx

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://docs.perplexity.ai/guides/pricing#detailed-pricing...

xAI: Grok 4.20

multimodal
by xAI · 2,000,000 ctx

Grok 4.20 is a reasoning model from xAI with industry-leading speed and agentic tool calling capabilities. It combines the lowest halluci...

xAI: Grok 4.20 Multi-Agent

multimodal
by xAI · 2,000,000 ctx

Grok 4.20 Multi-Agent is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based workflows. Multiple agents operate in paral...

xAI: Grok 4.3

multimodal
by xAI · 1,000,000 ctx

Grok 4.3 is a reasoning model from xAI. It accepts text and image inputs with text output, and is suited for agentic workflows, instructi...

xAI: Grok Build 0.1

multimodal
by xAI · 256,000 ctx

Grok Build 0.1 is xAI’s fast coding model trained specifically for agentic software engineering workflows. It supports text and image inp...