How does GPT-5.4 xHigh compare with other LLMs?

AskClash compares GPT-5.4 xHigh against nearby AI models using public benchmark scores, pricing, context window, and access details.

What benchmarks are tracked for GPT-5.4 xHigh?

The page shows cached public benchmark cells such as HLE, GPQA, SWE-bench, SWE-Pro, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and related model scores when available.

LLM Leaderboard · Proprietary

GPT-5.4 xHigh benchmarks, pricing, and LLM comparison.

Compare GPT-5.4 xHigh vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Open live LLM leaderboard Open in app

Rank #7AskClash overall score: 73.5

$2.50 / $15.0Input and output token price, when published. Context: 1.05M.

API/OAuthBilling and access path cached for this model row.

GPT-5.4 xHigh benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall73.5

Benchmark cells14

Context1.05M

CreatorOpenAI

GPT-5.4 xHigh public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

RWT

8.0 score

Coding Agent Index

71.1 score

HLE

52.1 score

GPQA

92.8 score

SWE-Pro

57.7 score

Terminal-Bench

75.1 score

OSWorld

75.0 score

MCP Atlas

70.6 score

CharXiv

82.8 score

MMMU-Pro

81.2 score

ARC-AGI 2

73.3 score

Tau2

98.9 score

MRCR

97.3 score

GPT-5.4 xHigh vs other AI models

Use these comparison links to evaluate GPT-5.4 xHigh against nearby LLMs by benchmark score, price, context window, and provider.

Claude Mythos/Fable 5 vs GPT-5.4 xHigh Claude Opus 4.8 (Adaptive) vs GPT-5.4 xHigh GPT-5.5 xHigh vs GPT-5.4 xHigh GLM-5.2 vs GPT-5.4 xHigh Claude Opus 4.7 (Adaptive) vs GPT-5.4 xHigh Qwen3.7 Max vs GPT-5.4 xHigh GPT-5.4 xHigh vs Kimi K2.7 Code GPT-5.4 xHigh vs MiniMax-M3

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

LangChain langchain==1.3.10 Release Notes

fix(langchain): detect provider strategy for dated `gpt-5.2`/`gpt-5.4` snapshots (#38222) test(core,langchain): update tests for explicit deserialization allowlists (#38118)

Trusted access for the next era of cyber defense

Trusted access for the next era of cyber defense Simon Willison’s Weblog Subscribe Sponsored by: Teleport — Connect agents to your infra in seconds with Teleport Beams. Built-in identity. Zero secrets. Get early access 14th April 2026 - Link Blog Trusted access for the next era of cyber defense ( via ) OpenAI's answer to Claude Mythos appears to be a new model called GPT-5.4-Cyber: In preparation for increasingly more capable models from OpenAI over the next few months, we are fine-tuning our mo

Quoting Romain Huet

A quote from Romain Huet Simon Willison’s Weblog Subscribe Sponsored by: Sonar — Now with SAST + SCA for secure, dependency-aware Agentic Engineering. SonarQube Advanced Security 25th April 2026 Since GPT-5.4, we’ve unified Codex and the main model into a single system, so there’s no separate coding line anymore. GPT-5.5 takes this further, with strong gains in agentic coding, computer use, and any task on a computer. — Romain Huet , confirming OpenAI won't release a GPT-5.5-Codex model Posted 2

datasette-llm 0.1a4

Release: datasette-llm 0.1a4 Simon Willison’s Weblog Subscribe Sponsored by: WorkOS — Ready to sell to Enterprise clients? Build and ship securely with WorkOS. 31st March 2026 Release datasette-llm 0.1a4 — LLM integration plugin for other plugins to depend on Ability to configure different API keys for models based on their purpose - for example, set it up so enrichments always use gpt-5.4-mini with an API key dedicated to that purpose. #4 I released llm-echo 0.3 to provide an API key testing ut

Last cached leaderboard date: June 18, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.