How does Claude Opus 4.6 (Adaptive) compare with other LLMs?

AskClash compares Claude Opus 4.6 (Adaptive) against nearby AI models using public benchmark scores, pricing, context window, and access details.

What benchmarks are tracked for Claude Opus 4.6 (Adaptive)?

The page shows cached public benchmark cells such as HLE, GPQA, SWE-bench, SWE-Pro, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and related model scores when available.

LLM Leaderboard · Proprietary

Claude Opus 4.6 (Adaptive) benchmarks, pricing, and LLM comparison.

Compare Claude Opus 4.6 (Adaptive) vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Open live LLM leaderboard Open in app

Rank #21AskClash overall score: 47.6

$5.00 / $25.0Input and output token price, when published. Context: 1M.

API/OAuthBilling and access path cached for this model row.

Claude Opus 4.6 (Adaptive) benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall47.6

Benchmark cells9

Context1M

CreatorAnthropic

Claude Opus 4.6 (Adaptive) public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

HLE

40.0 score

GPQA

84.0 score

MATH-500

89.2 score

SWE-bench

80.8 score

Terminal-Bench

65.4 score

OSWorld

72.7 score

ARC-AGI 2

68.8 score

Tau2

92.1 score

Claude Opus 4.6 (Adaptive) vs other AI models

Use these comparison links to evaluate Claude Opus 4.6 (Adaptive) against nearby LLMs by benchmark score, price, context window, and provider.

Claude Opus 4.6 (Adaptive) vs GPT-5.5 xHigh Claude Opus 4.6 (Adaptive) vs GPT-5.5 Claude Opus 4.6 (Adaptive) vs Claude Opus 4.7 (Adaptive)Claude Opus 4.6 (Adaptive) vs Gemini 3.5 Flash Claude Opus 4.6 (Adaptive) vs GPT-5.4 Claude Opus 4.6 (Adaptive) vs Claude Mythos Preview Claude Opus 4.6 (Adaptive) vs Claude Opus 4.7 Claude Opus 4.6 (Adaptive) vs Qwen3.7 Max

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP

Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP Source: arXiv Logic / Formal Methods URL: https://arxiv.org/abs/2603.20405

Opus 4.6, Codex 5.3, and the post-benchmark era

Nathan Lambert - Interconnects

[AINews] Moonshot Kimi K2.6: the world's leading Open Model refreshes to catch up to Opus 4.6 (ahead of DeepSeek v4?)

Latent Space

Claude Token Counter, now with model comparisons

Claude Token Counter, now with model comparisons Simon Willison’s Weblog Subscribe Sponsored by: Honeycomb — AI agents behave unpredictably. Get the context you need to debug what actually happened. Read the blog 20th April 2026 - Link Blog Claude Token Counter, now with model comparisons . I upgraded my Claude Token Counter tool to add the ability to run the same count against different models in order to compare them. As far as I can tell Claude Opus 4.7 is the first model to change the tokeni

Last cached leaderboard date: May 22, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.