LLM Leaderboard · Proprietary

GPT-5.5 xHigh benchmarks, pricing, and LLM comparison.

Compare GPT-5.5 xHigh vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Rank #1AskClash overall score: 91.4
$5.00 / $30.0Input and output token price, when published. Context: 1M.
API/OAuthBilling and access path cached for this model row.

GPT-5.5 xHigh benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall91.4
Benchmark cells14
Context1M
CreatorOpenAI

GPT-5.5 xHigh public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

HLE

52.2 score

GPQA

93.6 score

SWE-bench

88.7 score

SWE-Pro

58.6 score

Terminal-Bench

84.1 score

OSWorld

78.7 score

MCP Atlas

75.3 score

Finance Agent

51.8 score

MMMU-Pro

83.2 score

ARC-AGI 2

85.0 score

Tau2

98.0 score

GPT-5.5 xHigh vs other AI models

Use these comparison links to evaluate GPT-5.5 xHigh against nearby LLMs by benchmark score, price, context window, and provider.

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

OpenAI locks GPT-5.5-Cyber behind velvet rope despite slamming Anthropic for doing exactly that

OpenAI locks GPT-5.5-Cyber behind velvet rope • The Register The Register Home Page Search Search The Register Navigation Topics Security All Security Cyber-crime Patches Research CSO Off-Prem All Off-Prem Edge + IoT Channel PaaS + IaaS SaaS On-Prem All On-Prem Systems Storage Networks HPC Personal Tech Cx0 Public Sector Software All Software AI + ML Applications Databases DevOps OSes Virtualization Offbeat All Offbeat Columnists Science Geek's Guide BOFH Legal Bootnotes Site News About Us More

Our evaluation of OpenAI's GPT-5.5 cyber capabilities

Our evaluation of OpenAI's GPT-5.5 cyber capabilities Simon Willison’s Weblog Subscribe 30th April 2026 - Link Blog Our evaluation of OpenAI's GPT-5.5 cyber capabilities . The UK's AI Security Institute previously evaluated Claude Mythos : now they've evaluated GPT-5.5 for finding security vulnerability and found it to be comparable to Mythos, but unlike Mythos it's generally available right now. Posted 30th April 2026 at 11:03 pm Recent articles LLM 0.32a0 is a major backwards-compatible refact

GPT-5.5 may burn fewer tokens, but it always burns more cash

GPT-5.5 may burn fewer tokens, but it always burns more cash .bg-secondary.op-bg_20 .bg-secondary.op-bg_40 .bg-secondary.op-bg_60 .bg-secondary.op-bg_80 .bg-tertiary.op-bg_20 .bg-tertiary.op-bg_40 .bg-tertiary.op-bg_60 .bg-tertiary.op-bg_80 .bg-quaternary.op-bg_20 .bg-quaternary.op-bg_40 .bg-quaternary.op-bg_60 .bg-quaternary.op-bg_80 .bg-quinary.op-bg_20 .bg-quinary.op-bg_40 .bg-quinary.op-bg_60 .bg-quinary.op-bg_80 .bg-senary.op-bg_20 .bg-senary.op-bg_40 .bg-senary.op-bg_60 .bg-senary.op-bg_80

GPT-5.5 prompting guide

GPT-5.5 prompting guide Simon Willison’s Weblog Subscribe Sponsored by: Sonar — Now with SAST + SCA for secure, dependency-aware Agentic Engineering. SonarQube Advanced Security 25th April 2026 - Link Blog GPT-5.5 prompting guide . Now that GPT-5.5 is available in the API , OpenAI have released a wealth of useful tips on how best to prompt the new model. Here's a neat trick they recommend for applications that might spend considerable time thinking before returning a user-visible response: Befor

Last cached leaderboard date: May 22, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.