LLM Leaderboard · Proprietary

GPT-5.4 xHigh benchmarks, pricing, and LLM comparison.

Compare GPT-5.4 xHigh vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Rank #7AskClash overall score: 73.5
$2.50 / $15.0Input and output token price, when published. Context: 1.05M.
API/OAuthBilling and access path cached for this model row.

GPT-5.4 xHigh benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall73.5
Benchmark cells14
Context1.05M
CreatorOpenAI

GPT-5.4 xHigh public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

RWT

8.0 score

Coding Agent Index

71.1 score

HLE

52.1 score

GPQA

92.8 score

SWE-Pro

57.7 score

Terminal-Bench

75.1 score

OSWorld

75.0 score

MCP Atlas

70.6 score

CharXiv

82.8 score

MMMU-Pro

81.2 score

ARC-AGI 2

73.3 score

Tau2

98.9 score

MRCR

97.3 score

GPT-5.4 xHigh vs other AI models

Use these comparison links to evaluate GPT-5.4 xHigh against nearby LLMs by benchmark score, price, context window, and provider.

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

LangChain langchain==1.3.10 Release Notes

fix(langchain): detect provider strategy for dated `gpt-5.2`/`gpt-5.4` snapshots (#38222) test(core,langchain): update tests for explicit deserialization allowlists (#38118)

Trusted access for the next era of cyber defense

Trusted access for the next era of cyber defense Simon Willison’s Weblog Subscribe Sponsored by: Teleport — Connect agents to your infra in seconds with Teleport Beams. Built-in identity. Zero secrets. Get early access 14th April 2026 - Link Blog Trusted access for the next era of cyber defense ( via ) OpenAI's answer to Claude Mythos appears to be a new model called GPT-5.4-Cyber: In preparation for increasingly more capable models from OpenAI over the next few months, we are fine-tuning our mo

Quoting Romain Huet

A quote from Romain Huet Simon Willison’s Weblog Subscribe Sponsored by: Sonar — Now with SAST + SCA for secure, dependency-aware Agentic Engineering. SonarQube Advanced Security 25th April 2026 Since GPT-5.4, we’ve unified Codex and the main model into a single system, so there’s no separate coding line anymore. GPT-5.5 takes this further, with strong gains in agentic coding, computer use, and any task on a computer. — Romain Huet , confirming OpenAI won't release a GPT-5.5-Codex model Posted 2

datasette-llm 0.1a4

Release: datasette-llm 0.1a4 Simon Willison’s Weblog Subscribe Sponsored by: WorkOS — Ready to sell to Enterprise clients? Build and ship securely with WorkOS. 31st March 2026 Release datasette-llm 0.1a4 — LLM integration plugin for other plugins to depend on Ability to configure different API keys for models based on their purpose - for example, set it up so enrichments always use gpt-5.4-mini with an API key dedicated to that purpose. #4 I released llm-echo 0.3 to provide an API key testing ut

Last cached leaderboard date: June 18, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.