AgMoDB
ModelsAgentsEvalsIndustry
AgMoDB by @mistakeknot

Find the right AI model

Select your use case — we'll rank the best options for your needs.

716 models tracked across 187 benchmarks. Updated daily.

Coding

Software development & debugging

Research / Data Analysis

Academic research & data science

Creative Writing

Stories, marketing & content

General Assistant

Chat, Q&A & summarization

Agentic Tasks

Tool use & multi-step workflows

Not sure / Explore all

See the best across all categories

Already know what you're looking for? Browse all models

Models3D ExplorerBenchmarksCompare

Best Models by Category

Updated daily
Popular

Top-rated for Coding

Ranked by AgMoBench overall score (observed data only)

1
Gemini 3.1 Pro Preview
Google$4.50/MAgMoBench: 77.6

Also top for Top-rated for Research, Top-rated for Creative Writing, Top-rated for General Use, Top-rated for Agentic Tasks

2
Claude Opus 4.6 (Non-reasoning, High Effort)
Anthropic$10.00/MAgMoBench: 75.0

Also top for Top-rated for Research, Top-rated for Creative Writing, Top-rated for General Use, Top-rated for Agentic Tasks

3
Claude Sonnet 4.6 (Non-reasoning, High Effort)
Anthropic$6.00/MAgMoBench: 74.0

Also top for Top-rated for Research, Top-rated for Creative Writing, Top-rated for General Use, Top-rated for Agentic Tasks

See all →

Top-rated for Research

Ranked by AgMoBench overall score (observed data only)

1
Gemini 3.1 Pro Preview
Google$4.50/MAgMoBench: 77.6

Also top for Top-rated for Coding, Top-rated for Creative Writing, Top-rated for General Use, Top-rated for Agentic Tasks

2
Claude Opus 4.6 (Non-reasoning, High Effort)
Anthropic$10.00/MAgMoBench: 75.0

Also top for Top-rated for Coding, Top-rated for Creative Writing, Top-rated for General Use, Top-rated for Agentic Tasks

3
Claude Sonnet 4.6 (Non-reasoning, High Effort)
Anthropic$6.00/MAgMoBench: 74.0

Also top for Top-rated for Coding, Top-rated for Creative Writing, Top-rated for General Use, Top-rated for Agentic Tasks

See all →

Top-rated for Creative Writing

Ranked by AgMoBench overall score (observed data only)

1
Gemini 3.1 Pro Preview
Google$4.50/MAgMoBench: 77.6

Also top for Top-rated for Coding, Top-rated for Research, Top-rated for General Use, Top-rated for Agentic Tasks

2
Claude Opus 4.6 (Non-reasoning, High Effort)
Anthropic$10.00/MAgMoBench: 75.0

Also top for Top-rated for Coding, Top-rated for Research, Top-rated for General Use, Top-rated for Agentic Tasks

3
Claude Sonnet 4.6 (Non-reasoning, High Effort)
Anthropic$6.00/MAgMoBench: 74.0

Also top for Top-rated for Coding, Top-rated for Research, Top-rated for General Use, Top-rated for Agentic Tasks

See all →
Popular

Top-rated for General Use

Ranked by AgMoBench overall score (observed data only)

1
Gemini 3.1 Pro Preview
Google$4.50/MAgMoBench: 77.6

Also top for Top-rated for Coding, Top-rated for Research, Top-rated for Creative Writing, Top-rated for Agentic Tasks

2
Claude Opus 4.6 (Non-reasoning, High Effort)
Anthropic$10.00/MAgMoBench: 75.0

Also top for Top-rated for Coding, Top-rated for Research, Top-rated for Creative Writing, Top-rated for Agentic Tasks

3
Claude Sonnet 4.6 (Non-reasoning, High Effort)
Anthropic$6.00/MAgMoBench: 74.0

Also top for Top-rated for Coding, Top-rated for Research, Top-rated for Creative Writing, Top-rated for Agentic Tasks

See all →

Top-rated for Agentic Tasks

Ranked by AgMoBench overall score (observed data only)

1
Gemini 3.1 Pro Preview
Google$4.50/MAgMoBench: 77.6

Also top for Top-rated for Coding, Top-rated for Research, Top-rated for Creative Writing, Top-rated for General Use

2
Claude Opus 4.6 (Non-reasoning, High Effort)
Anthropic$10.00/MAgMoBench: 75.0

Also top for Top-rated for Coding, Top-rated for Research, Top-rated for Creative Writing, Top-rated for General Use

3
Claude Sonnet 4.6 (Non-reasoning, High Effort)
Anthropic$6.00/MAgMoBench: 74.0

Also top for Top-rated for Coding, Top-rated for Research, Top-rated for Creative Writing, Top-rated for General Use

See all →

Best Value

Best quality-to-price ratio (observed data only)

1
gpt-oss-20B (high)
OpenAI$0.09/MAgMoBench: 35.3
2
Gemma 3n E4B Instruct
Google$0.03/MAgMoBench: 8.8
3
gpt-oss-20B (low)
OpenAI$0.09/MAgMoBench: 29.1
See all →

Fastest

Highest output tokens per second

1
Mercury 2
Inception$0.38/MSpeed: 927 tok/s
2
Granite 4.0 H Small
IBM$0.11/MSpeed: 546 tok/s
3
Gemini 2.5 Flash-Lite (Non-reasoning)
Google$0.17/MSpeed: 374 tok/s
See all →

Recently Updated

View all

Upstage

Solar Pro 3

$0.00/M0 tok/s

Google

Gemma 4 E4B (Non-reasoning)

$0.00/M0 tok/s

Google

Gemma 4 E4B

21.1
$0.00/M0 tok/s

Google

Gemma 4 31B (Reasoning)

$0.00/M35 tok/s

Google

Gemma 4 31B (Non-reasoning)

$0.00/M0 tok/s

Google

Gemma 4 26B A4B (Reasoning)

$0.20/M0 tok/s