INTELLIGENCE HUB

AI Model Intelligence Hub

Comprehensive directory of production-ready large language models. Track specifications, performance benchmarks, and deployment status.

Models Tracked40
Providers Covered8
Last UpdatedJune 16, 2026
OpenAI

GPT-5.5 Pro

active

Our flagship enterprise-grade model. Unparalleled capabilities in complex programming, scientific logic, and multi-modal understanding.

Context1,050k
Top BenchmarkHELLASWAG: 99.4%
Input Price / M$30.00
Output Price / M$180.00
OpenAI

GPT-5.5

active

Our next-generation frontier model, optimized for highly complex multimodal reasoning, advanced mathematics, and native agentic planning.

Context1,050k
Top BenchmarkHELLASWAG: 99%
Input Price / M$5.00
Output Price / M$30.00
OpenAI

GPT-5

active

Our standard next-generation model. Excellent balance of cost, speed, and intelligence.

Context400k
Top BenchmarkHELLASWAG: 98.5%
Input Price / M$1.25
Output Price / M$10.00
OpenAI

GPT-5 Mini

active

A fast, cheap, and highly capable mini model from the GPT-5 family.

Context400k
Top BenchmarkHELLASWAG: 95.5%
Input Price / M$0.25
Output Price / M$2.00
OpenAI

GPT-4o

active

Our highly versatile and intelligent model, optimizing speed and capabilities across text, audio, and vision. Ideal for complex reasoning, coding tasks, and real-time conversation applications.

Context128k
Top BenchmarkHUMANEVAL: 90.2%
Input Price / M$2.50
Output Price / M$10.00
OpenAI

GPT-4o Mini

active

Our most cost-efficient and lightweight model designed for high-frequency, low-latency tasks. Excels at simple text processing, routing, and basic structured outputs.

Context128k
Top BenchmarkHELLASWAG: 84.7%
Input Price / M$0.15
Output Price / M$0.60
OpenAI

OpenAI o1

active

A specialized reasoning model trained with reinforcement learning to think before responding. Exceptional at complex math, physics, coding, and logical troubleshooting.

Context200k
Top BenchmarkMATH: 94.8%
Input Price / M$15.00
Output Price / M$60.00
OpenAI

OpenAI o3-mini

active

Our latest reasoning model that provides advanced math and coding capabilities at the speed of a mini model. Optimized for low-cost, high-speed scientific reasoning.

Context200k
Top BenchmarkMATH: 90.2%
Input Price / M$1.10
Output Price / M$4.40
Anthropic

Claude 4.6 Sonnet

active

Our state-of-the-art model offering the best balance of speed, capability, and cost-effectiveness. Highly reliable for software engineering, computer-use agents, and structured data analysis.

Context1,000k
Top BenchmarkHELLASWAG: 96%
Input Price / M$3.00
Output Price / M$15.00
Anthropic

Claude 4.5 Haiku

active

Anthropic's fastest and most cost-effective model, offering high speed and advanced multi-turn conversation logic. Optimized for lightweight user-facing applications and agent orchestration.

Context200k
Top BenchmarkHELLASWAG: 93.5%
Input Price / M$1.00
Output Price / M$5.00
Google

Gemini 3.5 Flash

active

Google's latest high-speed, high-efficiency model, featuring a large context window and multimodality.

Context1,048.576k
Top BenchmarkHELLASWAG: 97.8%
Input Price / M$1.50
Output Price / M$9.00
Google

Gemini 3.1 Pro

active

Google's premiere multi-modal model featuring a massive 2 million token context window. Engineered for deep code analysis, video indexing, and long-context reasoning.

Context2,000k
Top BenchmarkHELLASWAG: 98.4%
Input Price / M$2.00
Output Price / M$12.00
Google

Gemini 3.1 Flash

active

Our high-speed, cost-efficient model. Features a 1 million token context window, optimized for high-volume content synthesis, classification, and routing.

Context1,000k
Top BenchmarkHELLASWAG: 95.2%
Input Price / M$0.25
Output Price / M$1.50
Google

Gemini 2.5 Pro

active

Previous-generation professional model featuring the 2 million context length, optimized for large file processing and code reasoning.

Context2,000k
Top BenchmarkHELLASWAG: 96.2%
Input Price / M$1.25
Output Price / M$10.00
Meta

Llama 4 Maverick

active

Meta's next-generation open weights model. Delivers premium agentic capabilities, reasoning, and tool call compliance for local or self-hosted enterprise stacks.

Context1,048.576k
Top BenchmarkHELLASWAG: 97.2%
Input Price / M$0.15
Output Price / M$0.60
Meta

Llama 4 Scout

active

Meta's ultra-long context model for document processing and massive retrieval tasks.

Context10,000k
Top BenchmarkHELLASWAG: 94.5%
Input Price / M$0.10
Output Price / M$0.30
Meta

Llama 3.3 70B Instruct

active

Meta's state-of-the-art open weights model, providing enterprise-grade reasoning and logic. Exceptionally powerful for self-hosted customer support, text generation, and tooling workflows.

Context131.072k
Top BenchmarkHELLASWAG: 88.5%
Input Price / M$0.10
Output Price / M$0.32
Meta

Llama 3.2 11B Vision

active

Meta's lightweight open weights vision model, optimized for mobile devices and local deployments. Capable of visual understanding, chart reading, and fast text generation.

Context131.072k
Top BenchmarkHELLASWAG: 82%
Input Price / M$0.34
Output Price / M$0.34
Mistral

Mistral Large 3

active

Mistral's flagship commercial model, boasting multilingual support and advanced coding and math skills. Designed for complex reasoning and enterprise tasks that require high compliance.

Context262.144k
Top BenchmarkHELLASWAG: 90%
Input Price / M$0.50
Output Price / M$1.50
Mistral

Mistral Small 3

active

A fast, reliable model tailored for moderate-complexity tasks. Excels at translation, text summarizing, and structured JSON parsing at an affordable price point.

Context128k
Top BenchmarkHELLASWAG: 85%
Input Price / M$0.07
Output Price / M$0.20
Cohere

Command R+

active

Cohere's enterprise-optimized model built for advanced Retrieval-Augmented Generation (RAG) and multi-step tool use. Highly effective for multilingual business processes.

Context128k
Top BenchmarkHELLASWAG: 82.5%
Input Price / M$2.50
Output Price / M$10.00
Cohere

Command R

active

Lightweight model engineered for productivity applications and high-speed enterprise integrations. Optimizes RAG performance and API tool integrations.

Context128k
Top BenchmarkHELLASWAG: 78%
Input Price / M$0.15
Output Price / M$0.60
xAI

Grok 4.20

active

xAI's flagship reasoning model with real-time X platform data integration. Exceptionally strong in physics, mathematics, and advanced code synthesis.

Context2,000k
Top BenchmarkHELLASWAG: 98.8%
Input Price / M$1.25
Output Price / M$2.50
xAI

Grok 4.3

active

xAI's high-performance model featuring world-class coding capabilities, real-time web access, and mathematical proof-solving.

Context1,000k
Top BenchmarkHELLASWAG: 97.5%
Input Price / M$1.25
Output Price / M$2.50
DeepSeek

DeepSeek V4 Pro

active

A state-of-the-art Mixture of Experts (MoE) model featuring 671B parameters. Offers performance comparable to top-tier commercial models at a fraction of the inference cost.

Context1,048.576k
Top BenchmarkHELLASWAG: 98.6%
Input Price / M$0.43
Output Price / M$0.87
DeepSeek

DeepSeek V4 Flash

active

DeepSeek's latest lightweight and ultra-cost-efficient model for fast, high-frequency tasks.

Context1,048.576k
Top BenchmarkHELLASWAG: 95.5%
Input Price / M$0.10
Output Price / M$0.20
DeepSeek

DeepSeek R1

active

A premier reasoning model employing large-scale reinforcement learning. Displays specialized math, coding, and logical validation capabilities comparable to OpenAI's o1.

Context163.84k
Top BenchmarkHELLASWAG: 94%
Input Price / M$0.70
Output Price / M$2.50
Anthropic

Claude 4.7 Opus

active

Anthropic's highly capable Opus-tier reasoning model with superior performance on complex multi-step tasks, autonomous coding agents, and scientific analysis.

Context1,000k
Top BenchmarkHELLASWAG: 98.8%
Input Price / M$5.00
Output Price / M$25.00
Anthropic

Claude 4.6 Opus

active

The Opus-tier variant of Claude 4.6, designed for the most demanding enterprise agentic workflows requiring deep analytical reasoning and long-context synthesis.

Context1,000k
Top BenchmarkHELLASWAG: 97.5%
Input Price / M$5.00
Output Price / M$25.00
Anthropic

Claude 4.5 Sonnet

active

High-performance Sonnet model excelling at nuanced writing, coding, and agentic tasks. Offers extended 1M-token context for large-document workflows.

Context1,000k
Top BenchmarkHELLASWAG: 95.8%
Input Price / M$3.00
Output Price / M$15.00
Anthropic

Claude 4.5 Opus

active

Anthropic's flagship model from the 4.5 generation, delivering exceptional reasoning and complex problem-solving at the frontier of AI capabilities.

Context200k
Top BenchmarkHELLASWAG: 97.2%
Input Price / M$5.00
Output Price / M$25.00
Anthropic

Claude 3.5 Sonnet

active

Anthropic's highly popular previous-generation model, widely praised for coding, complex reasoning, and high-quality writing.

Context200k
Top BenchmarkHELLASWAG: 95.4%
Input Price / M$3.00
Output Price / M$15.00
Anthropic

Claude 3.5 Haiku

active

A fast and efficient model from the Claude 3.5 family, offering high performance for coding and text processing tasks.

Context200k
Top BenchmarkHELLASWAG: 89.2%
Input Price / M$0.80
Output Price / M$4.00
Anthropic

Claude 3 Opus

active

Anthropic's powerful model from the Claude 3 family, engineered for deep analysis, research, and complex task execution.

Context200k
Top BenchmarkHELLASWAG: 95%
Input Price / M$15.00
Output Price / M$75.00
Anthropic

Claude Sonnet 4

active

The next-generation Sonnet model featuring a 1M-token context window. Well-suited for enterprise coding, document understanding, and complex instruction-following.

Context1,000k
Top BenchmarkHELLASWAG: 95%
Input Price / M$3.00
Output Price / M$15.00
Anthropic

Claude Opus 4

active

Anthropic's first Claude 4-generation flagship, offering deep expert-level reasoning, complex writing, and high-performance code synthesis across long contexts.

Context200k
Top BenchmarkHELLASWAG: 96.5%
Input Price / M$15.00
Output Price / M$75.00

Missing a model you use in production? Suggest a model →