How do Composer 2.5 and Claude Opus 4.6 (Adaptive) compare on coding benchmarks?

The comparison table shows SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, LiveCodeBench, and coding agent index scores for both Composer 2.5 and Claude Opus 4.6 (Adaptive) when publicly disclosed.

LLM Comparison

Composer 2.5 vs Claude Opus 4.6 (Adaptive): benchmark scores, pricing & comparison.

Q: Which is better: Composer 2.5 or Claude Opus 4.6 (Adaptive)?

AskClash compares Composer 2.5 and Claude Opus 4.6 (Adaptive) side by side across SWE-bench, GPQA, HLE, Terminal-Bench, coding agent scores, token pricing, and context window so you can see which model wins on each benchmark.

Side-by-side Composer 2.5 vs Claude Opus 4.6 (Adaptive) comparison across SWE-bench, GPQA, HLE, Terminal-Bench, coding agent scores, token pricing, context window, and AskClash RWT. Green marks the winner on each benchmark.

Open live leaderboard Composer 2.5 model page Claude Opus 4.6 (Adaptive) model page

Rank #10 vs #11AskClash overall scores 67.0 vs 66.8.

Pricing $0.50/$2.50 vs $5.00/$25.0Input and output token prices per 1M tokens when published.

Proprietary vs ProprietaryCursor vs Anthropic.

Composer 2.5 vs Claude Opus 4.6 (Adaptive) benchmark comparison

Green cells highlight the winning model for each metric. Scores are cached from the AskClash LLM leaderboard snapshot.

Metric	Composer 2.5	Claude Opus 4.6 (Adaptive)
Overall Score	67.0	66.8
Leaderboard Rank	#10	#11
RWT	8.5	8.0
Coding Agent Index	51.8	71.1
HLE	—	53.0
GPQA	—	91.3
MATH-500	—	99.8
SWE-bench	—	80.8
SWE-Pro	47.0	—
SWE-Atlas	72.0	—
Terminal-Bench	69.3	65.4
OSWorld	—	72.7
MCP Atlas	—	59.5
Finance Agent	—	60.7
CharXiv	—	77.4
MMMU-Pro	—	77.3
ARC-AGI 2	—	68.8
Tau2	—	99.3
MRCR	—	76.0
Input Price (per 1M tokens)	$0.50	$5.00
Output Price (per 1M tokens)	$2.50	$25.0
Context Window	200K	1M
Benchmark Cells	4	14

More Composer 2.5 and Claude Opus 4.6 (Adaptive) comparisons

Explore how Composer 2.5 and Claude Opus 4.6 (Adaptive) stack up against other top-ranked LLMs.

Claude Mythos/Fable 5 vs Composer 2.5 Claude Mythos/Fable 5 vs Claude Opus 4.6 (Adaptive)Claude Opus 4.8 (Adaptive) vs Composer 2.5 Claude Opus 4.8 (Adaptive) vs Claude Opus 4.6 (Adaptive)GPT-5.5 xHigh vs Composer 2.5 GPT-5.5 xHigh vs Claude Opus 4.6 (Adaptive)GLM-5.2 vs Composer 2.5 GLM-5.2 vs Claude Opus 4.6 (Adaptive)Claude Opus 4.7 (Adaptive) vs Composer 2.5 Claude Opus 4.7 (Adaptive) vs Claude Opus 4.6 (Adaptive)

How to read this comparison

Benchmark scores

Higher is better for all benchmark scores (SWE-bench, GPQA, HLE, Terminal-Bench, etc.). Green marks the model with the higher score.

Token pricing

Lower is better for input and output prices. Green marks the cheaper model per 1M tokens.

Coverage matters

Models with fewer disclosed benchmark cells may have inflated percentile scores. Check the benchmark cell count for context.

This comparison page is generated from the AskClash LLM leaderboard cache. Open the live leaderboard for real-time scores and interactive filtering.