Our highly versatile and intelligent model, optimizing speed and capabilities across text, audio, and vision. Ideal for complex reasoning, coding tasks, and real-time conversation applications.

Context128k

Top BenchmarkHUMANEVAL: 90.2%

Input Price / M$2.50

Output Price / M$10.00

OpenAI

GPT-4o Mini

active

Our most cost-efficient and lightweight model designed for high-frequency, low-latency tasks. Excels at simple text processing, routing, and basic structured outputs.

Context128k

Top BenchmarkHELLASWAG: 84.7%

Input Price / M$0.15

Output Price / M$0.60

OpenAI

OpenAI o1

active

A specialized reasoning model trained with reinforcement learning to think before responding. Exceptional at complex math, physics, coding, and logical troubleshooting.

Context200k

Top BenchmarkMATH: 94.8%

Input Price / M$15.00

Output Price / M$60.00

OpenAI

OpenAI o3-mini

active

Our latest reasoning model that provides advanced math and coding capabilities at the speed of a mini model. Optimized for low-cost, high-speed scientific reasoning.

Context200k

Top BenchmarkMATH: 90.2%

Input Price / M$1.10

Output Price / M$4.40

Anthropic

Claude 4.6 Sonnet

active

Our state-of-the-art model offering the best balance of speed, capability, and cost-effectiveness. Highly reliable for software engineering, computer-use agents, and structured data analysis.

Context1,000k

Top BenchmarkHELLASWAG: 96%

Input Price / M$3.00

Output Price / M$15.00

Anthropic

Claude 4.5 Haiku

active

Anthropic's fastest and most cost-effective model, offering high speed and advanced multi-turn conversation logic. Optimized for lightweight user-facing applications and agent orchestration.

Context200k

Top BenchmarkHELLASWAG: 93.5%

Input Price / M$1.00

Output Price / M$5.00

Google

Gemini 3.5 Flash

active

Google's latest high-speed, high-efficiency model, featuring a large context window and multimodality.

Context1,048.576k

Top BenchmarkHELLASWAG: 97.8%

Input Price / M$1.50

Output Price / M$9.00

Google

Gemini 3.1 Pro

active

Google's premiere multi-modal model featuring a massive 2 million token context window. Engineered for deep code analysis, video indexing, and long-context reasoning.

Context2,000k

Top BenchmarkHELLASWAG: 98.4%

Input Price / M$2.00

Output Price / M$12.00

Google

Gemini 3.1 Flash

active

Our high-speed, cost-efficient model. Features a 1 million token context window, optimized for high-volume content synthesis, classification, and routing.

Context1,000k

Top BenchmarkHELLASWAG: 95.2%

Input Price / M$0.25

Output Price / M$1.50

Google

Gemini 2.5 Pro

active

Previous-generation professional model featuring the 2 million context length, optimized for large file processing and code reasoning.

Context2,000k

Top BenchmarkHELLASWAG: 96.2%

Input Price / M$1.25

Output Price / M$10.00

Meta

Llama 4 Maverick

active

Meta's next-generation open weights model. Delivers premium agentic capabilities, reasoning, and tool call compliance for local or self-hosted enterprise stacks.

Context1,048.576k

Top BenchmarkHELLASWAG: 97.2%

Input Price / M$0.15

Output Price / M$0.60

Meta

Llama 4 Scout

active

Meta's ultra-long context model for document processing and massive retrieval tasks.

Context10,000k

Top BenchmarkHELLASWAG: 94.5%

Input Price / M$0.10

Output Price / M$0.30

Meta

Llama 3.3 70B Instruct

active

Meta's state-of-the-art open weights model, providing enterprise-grade reasoning and logic. Exceptionally powerful for self-hosted customer support, text generation, and tooling workflows.

Context131.072k

Top BenchmarkHELLASWAG: 88.5%

Input Price / M$0.10

Output Price / M$0.32

Meta

Llama 3.2 11B Vision

active

Meta's lightweight open weights vision model, optimized for mobile devices and local deployments. Capable of visual understanding, chart reading, and fast text generation.

Context131.072k

Top BenchmarkHELLASWAG: 82%

Input Price / M$0.34

Output Price / M$0.34

Mistral

Mistral Large 3

active

Mistral's flagship commercial model, boasting multilingual support and advanced coding and math skills. Designed for complex reasoning and enterprise tasks that require high compliance.

Context262.144k

Top BenchmarkHELLASWAG: 90%

Input Price / M$0.50

Output Price / M$1.50

Mistral

Mistral Small 3

active

A fast, reliable model tailored for moderate-complexity tasks. Excels at translation, text summarizing, and structured JSON parsing at an affordable price point.

Context128k

Top BenchmarkHELLASWAG: 85%

Input Price / M$0.07

Output Price / M$0.20

Cohere

Command R+

active

Cohere's enterprise-optimized model built for advanced Retrieval-Augmented Generation (RAG) and multi-step tool use. Highly effective for multilingual business processes.

Context128k

Top BenchmarkHELLASWAG: 82.5%

Input Price / M$2.50

Output Price / M$10.00

Cohere

Command R

active

Lightweight model engineered for productivity applications and high-speed enterprise integrations. Optimizes RAG performance and API tool integrations.

Context128k

Top BenchmarkHELLASWAG: 78%

Input Price / M$0.15

Output Price / M$0.60

xAI

Grok 4.20

active

xAI's flagship reasoning model with real-time X platform data integration. Exceptionally strong in physics, mathematics, and advanced code synthesis.

Context2,000k

Top BenchmarkHELLASWAG: 98.8%

Input Price / M$1.25

Output Price / M$2.50

xAI

Grok 4.3

active

xAI's high-performance model featuring world-class coding capabilities, real-time web access, and mathematical proof-solving.

Context1,000k

Top BenchmarkHELLASWAG: 97.5%

Input Price / M$1.25

Output Price / M$2.50

DeepSeek

DeepSeek V4 Pro

active

A state-of-the-art Mixture of Experts (MoE) model featuring 671B parameters. Offers performance comparable to top-tier commercial models at a fraction of the inference cost.

Context1,048.576k

Top BenchmarkHELLASWAG: 98.6%

Input Price / M$0.43

Output Price / M$0.87

DeepSeek

DeepSeek V4 Flash

active

DeepSeek's latest lightweight and ultra-cost-efficient model for fast, high-frequency tasks.

Context1,048.576k

Top BenchmarkHELLASWAG: 95.5%

Input Price / M$0.10

Output Price / M$0.20

DeepSeek

DeepSeek R1

active

A premier reasoning model employing large-scale reinforcement learning. Displays specialized math, coding, and logical validation capabilities comparable to OpenAI's o1.

Context163.84k

Top BenchmarkHELLASWAG: 94%

Input Price / M$0.70

Output Price / M$2.50

Anthropic

Claude 4.7 Opus

active

Anthropic's highly capable Opus-tier reasoning model with superior performance on complex multi-step tasks, autonomous coding agents, and scientific analysis.

Context1,000k

Top BenchmarkHELLASWAG: 98.8%

Input Price / M$5.00

Output Price / M$25.00

Anthropic

Claude 4.6 Opus

active

The Opus-tier variant of Claude 4.6, designed for the most demanding enterprise agentic workflows requiring deep analytical reasoning and long-context synthesis.

Context1,000k

Top BenchmarkHELLASWAG: 97.5%

Input Price / M$5.00

Output Price / M$25.00

Anthropic

Claude 4.5 Sonnet

active

High-performance Sonnet model excelling at nuanced writing, coding, and agentic tasks. Offers extended 1M-token context for large-document workflows.

Context1,000k

Top BenchmarkHELLASWAG: 95.8%

Input Price / M$3.00

Output Price / M$15.00

Anthropic

Claude 4.5 Opus

active

Anthropic's flagship model from the 4.5 generation, delivering exceptional reasoning and complex problem-solving at the frontier of AI capabilities.

Context200k

Top BenchmarkHELLASWAG: 97.2%

Input Price / M$5.00

Output Price / M$25.00

Anthropic

Claude 3.5 Sonnet

active

Anthropic's highly popular previous-generation model, widely praised for coding, complex reasoning, and high-quality writing.

Context200k

Top BenchmarkHELLASWAG: 95.4%

Input Price / M$3.00

Output Price / M$15.00

Anthropic

Claude 3.5 Haiku

active

A fast and efficient model from the Claude 3.5 family, offering high performance for coding and text processing tasks.

Context200k

Top BenchmarkHELLASWAG: 89.2%

Input Price / M$0.80

Output Price / M$4.00

Anthropic

Claude 3 Opus

active

Anthropic's powerful model from the Claude 3 family, engineered for deep analysis, research, and complex task execution.

Context200k

Top BenchmarkHELLASWAG: 95%

Input Price / M$15.00

Output Price / M$75.00

Anthropic

Claude Sonnet 4

active

The next-generation Sonnet model featuring a 1M-token context window. Well-suited for enterprise coding, document understanding, and complex instruction-following.

Context1,000k

Top BenchmarkHELLASWAG: 95%

Input Price / M$3.00

Output Price / M$15.00

Anthropic

Claude Opus 4

active

Anthropic's first Claude 4-generation flagship, offering deep expert-level reasoning, complex writing, and high-performance code synthesis across long contexts.

Context200k

Top BenchmarkHELLASWAG: 96.5%

Input Price / M$15.00

Output Price / M$75.00

Missing a model you use in production? Suggest a model →