AI Leaderboard · Proprietary

Claude Opus 4.6 benchmarks, pricing, and ranking.

Rank #8AskClash overall score: 63.2

$5.00 / $25.0Input and output token price, when published. Context: 1M.

API/OAuthBilling and access path cached for this model row.

Claude Opus 4.6 benchmark snapshot

AskClash combines public benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall63.2

Benchmark cells13

Context1M

CreatorAnthropic

Public benchmark scores

Only benchmark columns with cached public values are shown here. Missing cells remain blank in the live table.

HLE

53.0 score

GPQA

91.3 score

MATH-500

98.0 score

SWE-bench

80.8 score

SWE-Pro

11.8 score

Terminal-Bench

70.2 score

OSWorld

72.7 score

MMMU-Pro

77.3 score

ARC-AGI 2

68.8 score

Tau2

84.8 score

Related model comparisons

Use these links to compare nearby frontier and open-weight models from the same AI leaderboard data.

GPT-5.5 GPT-5.5 xHigh Claude Opus 4.7 (Adaptive)GPT-5.4 Gemini 3.5 Flash Claude Opus 4.7 Qwen3.7 Max

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, or market context around this model.

[AINews] Anthropic Claude Opus 4.7 - literally one step better than 4.6 in every dimension

Latent Space

Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP

Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP Source: arXiv Logic / Formal Methods URL: https://arxiv.org/abs/2603.20405

Opus 4.6, Codex 5.3, and the post-benchmark era

Nathan Lambert - Interconnects

[AINews] Moonshot Kimi K2.6: the world's leading Open Model refreshes to catch up to Opus 4.6 (ahead of DeepSeek v4?)

Latent Space

Last cached leaderboard date: May 22, 2026. This model page is generated from the AskClash AI Leaderboard cache and linked from the live leaderboard.