How does Claude Opus 4.6 compare with other LLMs?

AskClash compares Claude Opus 4.6 against nearby AI models using public benchmark scores, pricing, context window, and access details.

What benchmarks are tracked for Claude Opus 4.6?

The page shows cached public benchmark cells such as HLE, GPQA, SWE-bench, SWE-Pro, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and related model scores when available.

LLM Leaderboard · Proprietary

Claude Opus 4.6 benchmarks, pricing, and LLM comparison.

Compare Claude Opus 4.6 vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Open live LLM leaderboard Open in app

Rank #9AskClash overall score: 62.7

$5.00 / $25.0Input and output token price, when published. Context: 1M.

API/OAuthBilling and access path cached for this model row.

Claude Opus 4.6 benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall62.7

Benchmark cells13

Context1M

CreatorAnthropic

Claude Opus 4.6 public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

HLE

53.0 score

GPQA

91.3 score

MATH-500

98.0 score

SWE-bench

80.8 score

SWE-Pro

11.8 score

Terminal-Bench

70.2 score

OSWorld

72.7 score

MMMU-Pro

77.3 score

ARC-AGI 2

68.8 score

Tau2

84.8 score

Claude Opus 4.6 vs other AI models

Use these comparison links to evaluate Claude Opus 4.6 against nearby LLMs by benchmark score, price, context window, and provider.

Claude Opus 4.6 vs GPT-5.5 xHigh Claude Opus 4.6 vs GPT-5.5 Claude Opus 4.6 vs Claude Opus 4.7 (Adaptive)Claude Opus 4.6 vs Gemini 3.5 Flash Claude Opus 4.6 vs GPT-5.4 Claude Opus 4.6 vs Claude Mythos Preview Claude Opus 4.6 vs Claude Opus 4.7 Claude Opus 4.6 vs Qwen3.7 Max

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.