How does GPT-5.2 compare with other LLMs?

AskClash compares GPT-5.2 against nearby AI models using public benchmark scores, pricing, context window, and access details.

What benchmarks are tracked for GPT-5.2?

The page shows cached public benchmark cells such as HLE, GPQA, SWE-bench, SWE-Pro, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and related model scores when available.

LLM Leaderboard · Proprietary

GPT-5.2 benchmarks, pricing, and LLM comparison.

Compare GPT-5.2 vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Open live LLM leaderboard Open in app

Rank #19AskClash overall score: 49.2

$1.75 / $14.0Input and output token price, when published. Context: 400K.

API/OAuthBilling and access path cached for this model row.

GPT-5.2 benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall49.2

Benchmark cells10

Context400K

CreatorOpenAI

GPT-5.2 public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

GPQA

92.4 score

MATH-500

98.0 score

SWE-bench

80.0 score

SWE-Pro

55.6 score

Terminal-Bench

62.2 score

OSWorld

47.3 score

CharXiv

82.1 score

MMMU-Pro

79.5 score

ARC-AGI 2

52.9 score

Tau2

84.8 score

GPT-5.2 vs other AI models

Use these comparison links to evaluate GPT-5.2 against nearby LLMs by benchmark score, price, context window, and provider.

GPT-5.2 vs GPT-5.5 xHigh GPT-5.2 vs GPT-5.5 GPT-5.2 vs Claude Opus 4.7 (Adaptive)GPT-5.2 vs Gemini 3.5 Flash GPT-5.2 vs GPT-5.4 GPT-5.2 vs Claude Mythos Preview GPT-5.2 vs Claude Opus 4.7 GPT-5.2 vs Qwen3.7 Max

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

OpenAI locks GPT-5.5-Cyber behind velvet rope despite slamming Anthropic for doing exactly that

OpenAI locks GPT-5.5-Cyber behind velvet rope • The Register The Register Home Page Search Search The Register Navigation Topics Security All Security Cyber-crime Patches Research CSO Off-Prem All Off-Prem Edge + IoT Channel PaaS + IaaS SaaS On-Prem All On-Prem Systems Storage Networks HPC Personal Tech Cx0 Public Sector Software All Software AI + ML Applications Databases DevOps OSes Virtualization Offbeat All Offbeat Columnists Science Geek's Guide BOFH Legal Bootnotes Site News About Us More

Our evaluation of OpenAI's GPT-5.5 cyber capabilities

Our evaluation of OpenAI's GPT-5.5 cyber capabilities Simon Willison’s Weblog Subscribe 30th April 2026 - Link Blog Our evaluation of OpenAI's GPT-5.5 cyber capabilities . The UK's AI Security Institute previously evaluated Claude Mythos : now they've evaluated GPT-5.5 for finding security vulnerability and found it to be comparable to Mythos, but unlike Mythos it's generally available right now. Posted 30th April 2026 at 11:03 pm Recent articles LLM 0.32a0 is a major backwards-compatible refact

[AINews] OpenAI GPT-next disproves 80 year old Erdős planar unit distance problem for under $1000

Latent Space

[AINews] GPT 5.5 and OpenAI Codex Superapp

Latent Space

Last cached leaderboard date: May 22, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.