Only curated links

AI Coding Benchmark Directory

A handpicked collection of benchmark sites for comparing AI models, coding agents and real-world performance.

Benchmark

AI Coding Models

LLM Benchmarks 2026 - Compare AI Benchmarks and Tests

Explore LLM benchmarks and AI benchmarks to compare models across reasoning, coding, math, and more independently verified.

Visit llm-stats.com ->

openrouter.ai

LLM Rankings | OpenRouter

LLM rankings and leaderboard based on real usage data from millions of users. See which AI models developers actually use.

Visit openrouter.ai ->

artificialanalysis.ai

Coding Index | Artificial Analysis

Compare AI model performance on Coding Index. Evaluates models' ability to solve programming problems, including those requiring scientific and research domain knowledge.

Visit artificialanalysis.ai ->

Benchmark

AI Coding Agents

artificialanalysis.ai

Coding Agents Comparison: Cursor, Claude Code, GitHub Copilot, and more

Comprehensive comparison of AI coding agents including Cursor, GitHub Copilot, Cline, Continue, and more. Compare IDE extensions, proprietary IDEs, CLI tools, and cloud platforms to find the best coding assistant for your development workflow.

Visit artificialanalysis.ai ->

prarena.ai

PR Arena - AI Coding Agent Leaderboard

Explore benchmark and evaluation details from prarena.ai in a focused external resource.

Visit prarena.ai ->