LLM Comparison

Claude Opus 4.8 (Adaptive) vs Gemini 3.5 Flash High: benchmark scores, pricing & comparison.

Side-by-side Claude Opus 4.8 (Adaptive) vs Gemini 3.5 Flash High comparison across SWE-bench, GPQA, HLE, Terminal-Bench, coding agent scores, token pricing, context window, and AskClash RWT. Green marks the winner on each benchmark.

Rank #2 vs #15AskClash overall scores 89.5 vs 59.5.
Pricing $5.00/$25.0 vs $1.50/$9.00Input and output token prices per 1M tokens when published.
Proprietary vs ProprietaryAnthropic vs Google.

Claude Opus 4.8 (Adaptive) vs Gemini 3.5 Flash High benchmark comparison

Green cells highlight the winning model for each metric. Scores are cached from the AskClash LLM leaderboard snapshot.

MetricClaude Opus 4.8 (Adaptive)Gemini 3.5 Flash High
Overall Score89.559.5
Leaderboard Rank#2#15
RWT9.57.0
HLE57.940.2
GPQA93.692.2
IFEval62.276.3
SWE-bench88.6
SWE-Pro69.255.1
Terminal-Bench74.676.2
OSWorld83.478.4
MCP Atlas82.283.6
Finance Agent53.957.9
CharXiv89.984.2
MMMU-Pro83.6
ARC-AGI 272.1
Tau294.495.3
MRCR77.3
Input Price (per 1M tokens)$5.00$1.50
Output Price (per 1M tokens)$25.0$9.00
Context Window1M1M
Benchmark Cells1314

More Claude Opus 4.8 (Adaptive) and Gemini 3.5 Flash High comparisons

Explore how Claude Opus 4.8 (Adaptive) and Gemini 3.5 Flash High stack up against other top-ranked LLMs.

How to read this comparison

Benchmark scores

Higher is better for all benchmark scores (SWE-bench, GPQA, HLE, Terminal-Bench, etc.). Green marks the model with the higher score.

Token pricing

Lower is better for input and output prices. Green marks the cheaper model per 1M tokens.

Coverage matters

Models with fewer disclosed benchmark cells may have inflated percentile scores. Check the benchmark cell count for context.

This comparison page is generated from the AskClash LLM leaderboard cache. Open the live leaderboard for real-time scores and interactive filtering.