Models v4
Comparison Table
Scan the full model inventory in a sortable, spreadsheet-like view.
| Model | Link |
|---|---|
xAI: Grok 4.1 FastOpenRouter Grok 4.1 Fast is xAI's best agentic tool calling model that shines in real-world use cases like customer support and deep research. 2M context window. Reasoning can be enabled/disabled using the `reasoning` `enabled` pa… TextImage | Open |
xAI: Grok 4 FastOpenRouter Grok 4 Fast is xAI's latest multimodal model with SOTA cost-efficiency and a 2M token context window. It comes in two flavors: non-reasoning and reasoning. Read more about the model on xAI's [news post](http://x.ai/news… TextImage | Open |
Auto RouterOpenRouter Your prompt will be processed by a meta-model and routed to one of dozens of models (see below), optimizing for the best possible output. To see which model was used, visit [Activity](/activity), or read the `model` att… Text | Open |
Google: Gemini 3 Flash PreviewOpenRouter Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool use performance with substantially… TextImageAudio | Open |
Google: Gemini 3 Pro PreviewOpenRouter Gemini 3 Pro is Google’s flagship frontier model for high-precision multimodal reasoning, combining strong performance across text, image, video, audio, and code with a 1M-token context window. Reasoning Details must be… TextImageAudio | Open |
Google: Gemini 2.5 Flash Preview 09-2025OpenRouter Gemini 2.5 Flash Preview September 2025 Checkpoint is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" c… ImageTextAudio | Open |
Google: Gemini 2.5 Flash Lite Preview 09-2025OpenRouter Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across… TextImageAudio | Open |
Google: Gemini 2.5 Flash LiteOpenRouter Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across… TextImageAudio | Open |
Google: Gemini 2.5 FlashOpenRouter Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provid… TextImageAudio | Open |
Google: Gemini 2.5 ProOpenRouter Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced… TextImageAudio | Open |
Google: Gemini 2.5 Pro Preview 06-05OpenRouter Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced… TextImageAudio | Open |
Google: Gemini 2.5 Pro Preview 05-06OpenRouter Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced… TextImageAudio | Open |
Meta: Llama 4 MaverickOpenRouter Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward pass (400B tota… TextImage | Open |
Google: Gemini 2.0 Flash LiteOpenRouter Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/… TextImageAudio | Open |
Google: Gemini 2.0 FlashOpenRouter Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemin… TextImageAudio | Open |
Google: Gemini 2.0 Flash Experimental (free)OpenRouter Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemin… TextImage | Open |
OpenAI: GPT-4.1OpenRouter GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o a… ImageText | Open |
OpenAI: GPT-4.1 MiniOpenRouter GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard instruction evals, 35.8% on… ImageText | Open |
OpenAI: GPT-4.1 NanoOpenRouter For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on M… ImageText | Open |
MiniMax: MiniMax-01OpenRouter MiniMax-01 is a combines MiniMax-Text-01 for text generation and MiniMax-VL-01 for image understanding. It has 456 billion parameters, with 45.9 billion parameters activated per inference, and can handle a context of up… TextImage | Open |
Amazon: Nova 2 LiteOpenRouter Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can process text, images, and videos to generate text. Nova 2 Lite demonstrates standout capabilities in processing documents, extracting… TextImageVideo | Open |
Amazon: Nova Premier 1.0OpenRouter Amazon Nova Premier is the most capable of Amazon’s multimodal models for complex reasoning tasks and for use as the best teacher for distilling custom models. TextImage | Open |
Anthropic: Claude Sonnet 4.5OpenRouter Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with i… TextImage | Open |
Qwen: Qwen Plus 0728OpenRouter Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination. Text | Open |
Qwen: Qwen Plus 0728 (thinking)OpenRouter Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination. Text | Open |
MiniMax: MiniMax M1OpenRouter MiniMax-M1 is a large-scale, open-weight reasoning model designed for extended context and high-efficiency inference. It leverages a hybrid Mixture-of-Experts (MoE) architecture paired with a custom "lightning attention… Text | Open |
Anthropic: Claude Sonnet 4OpenRouter Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on… ImageText | Open |
Qwen: Qwen-TurboOpenRouter Qwen-Turbo, based on Qwen2.5, is a 1M context model that provides fast speed and low cost, suitable for simple tasks. Text | Open |
OpenAI: GPT-5.2 ProOpenRouter GPT-5.2 Pro is OpenAI’s most advanced model, offering major improvements in agentic coding and long context performance over GPT-5 Pro. It is optimized for complex tasks that require step-by-step reasoning, instruction… ImageText | Open |
OpenAI: GPT-5.2OpenRouter GPT-5.2 is the latest frontier-grade model in the GPT-5 series, offering stronger agentic and long context perfomance compared to GPT-5.1. It uses adaptive reasoning to allocate computation dynamically, responding quick… TextImage | Open |
OpenAI: GPT-5.1-Codex-MaxOpenRouter GPT-5.1-Codex-Max is OpenAI’s latest agentic coding model, designed for long-running, high-context software development tasks. It is based on an updated version of the 5.1 reasoning stack and trained on agentic workflow… TextImage | Open |
OpenAI: GPT-5.1OpenRouter GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-purpose reasoning, improved instruction adherence, and a more natural conversational style compared to GPT-5. It uses adaptive re… ImageText | Open |
OpenAI: GPT-5.1-CodexOpenRouter GPT-5.1-Codex is a specialized version of GPT-5.1 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering… TextImage | Open |
OpenAI: GPT-5.1-Codex-MiniOpenRouter GPT-5.1-Codex-Mini is a smaller and faster version of GPT-5.1-Codex ImageText | Open |
OpenAI: GPT-5 Image MiniOpenRouter GPT-5 Image Mini combines OpenAI's advanced language capabilities, powered by [GPT-5 Mini](https://openrouter.ai/openai/gpt-5-mini), with GPT Image 1 Mini for efficient image generation. This natively multimodal model f… TextImage | Open |
OpenAI: GPT-5 ImageOpenRouter [GPT-5](https://openrouter.ai/openai/gpt-5) Image combines OpenAI's GPT-5 model with state-of-the-art image generation capabilities. It offers major improvements in reasoning, code quality, and user experience while inc… ImageText | Open |
OpenAI: GPT-5 ProOpenRouter GPT-5 Pro is OpenAI’s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and a… ImageText | Open |
OpenAI: GPT-5 CodexOpenRouter GPT-5-Codex is a specialized version of GPT-5 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering task… TextImage | Open |
OpenAI: GPT-5OpenRouter GPT-5 is OpenAI’s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and accur… TextImage | Open |
OpenAI: GPT-5 MiniOpenRouter GPT-5 Mini is a compact version of GPT-5, designed to handle lighter-weight reasoning tasks. It provides the same instruction-following and safety-tuning benefits as GPT-5, but with reduced latency and cost. GPT-5 Mini… TextImage | Open |
