How does GPT-5.5 compare with other LLMs?

AskClash compares GPT-5.5 against nearby AI models using public benchmark scores, pricing, context window, and access details.

What benchmarks are tracked for GPT-5.5?

The page shows cached public benchmark cells such as HLE, GPQA, SWE-bench, SWE-Pro, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and related model scores when available.

LLM Leaderboard · Proprietary

GPT-5.5 benchmarks, pricing, and LLM comparison.

Compare GPT-5.5 vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Open live LLM leaderboard Open in app

Rank #2AskClash overall score: 91.4

$5.00 / $30.0Input and output token price, when published. Context: 1M.

API/OAuthBilling and access path cached for this model row.

GPT-5.5 benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall91.4

Benchmark cells12

Context1M

CreatorOpenAI

GPT-5.5 public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

HLE

52.2 score

GPQA

93.6 score

SWE-Pro

58.6 score

Terminal-Bench

84.1 score

OSWorld

78.7 score

MCP Atlas

75.3 score

Finance Agent

51.8 score

MMMU-Pro

83.2 score

ARC-AGI 2

85.0 score

Tau2

98.0 score

GPT-5.5 vs other AI models

Use these comparison links to evaluate GPT-5.5 against nearby LLMs by benchmark score, price, context window, and provider.

GPT-5.5 vs GPT-5.5 xHigh GPT-5.5 vs Claude Opus 4.7 (Adaptive)GPT-5.5 vs Gemini 3.5 Flash GPT-5.5 vs GPT-5.4 GPT-5.5 vs Claude Mythos Preview GPT-5.5 vs Claude Opus 4.7 GPT-5.5 vs Qwen3.7 Max GPT-5.5 vs Claude Opus 4.6

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

OpenAI locks GPT-5.5-Cyber behind velvet rope despite slamming Anthropic for doing exactly that

OpenAI locks GPT-5.5-Cyber behind velvet rope • The Register The Register Home Page Search Search The Register Navigation Topics Security All Security Cyber-crime Patches Research CSO Off-Prem All Off-Prem Edge + IoT Channel PaaS + IaaS SaaS On-Prem All On-Prem Systems Storage Networks HPC Personal Tech Cx0 Public Sector Software All Software AI + ML Applications Databases DevOps OSes Virtualization Offbeat All Offbeat Columnists Science Geek's Guide BOFH Legal Bootnotes Site News About Us More

Our evaluation of OpenAI's GPT-5.5 cyber capabilities

Our evaluation of OpenAI's GPT-5.5 cyber capabilities Simon Willison’s Weblog Subscribe 30th April 2026 - Link Blog Our evaluation of OpenAI's GPT-5.5 cyber capabilities . The UK's AI Security Institute previously evaluated Claude Mythos : now they've evaluated GPT-5.5 for finding security vulnerability and found it to be comparable to Mythos, but unlike Mythos it's generally available right now. Posted 30th April 2026 at 11:03 pm Recent articles LLM 0.32a0 is a major backwards-compatible refact

[AINews] GPT 5.5 and OpenAI Codex Superapp

Latent Space

[AINews] OpenAI GPT-next disproves 80 year old Erdős planar unit distance problem for under $1000

Latent Space

Last cached leaderboard date: May 22, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.