How does Grok 4 compare with other LLMs?

AskClash compares Grok 4 against nearby AI models using public benchmark scores, pricing, context window, and access details.

What benchmarks are tracked for Grok 4?

The page shows cached public benchmark cells such as HLE, GPQA, SWE-bench, SWE-Pro, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and related model scores when available.

LLM Leaderboard · Proprietary

Grok 4 benchmarks, pricing, and LLM comparison.

Compare Grok 4 vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Open live LLM leaderboard Open in app

Rank #39AskClash overall score: 32.8

$1.25 / $2.50Input and output token price, when published. Context: 1M.

API/OAuthBilling and access path cached for this model row.

Grok 4 benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall32.8

Benchmark cells7

Context1M

CreatorxAI

Grok 4 public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

HLE

26.9 score

GPQA

87.7 score

MATH-500

99.0 score

IFEval

53.7 score

LiveCodeBench

81.9 score

ARC-AGI 2

15.9 score

Tau2

74.9 score

Grok 4 vs other AI models

Use these comparison links to evaluate Grok 4 against nearby LLMs by benchmark score, price, context window, and provider.

Grok 4 vs Claude Opus 4.8 (Adaptive)Grok 4 vs GPT-5.5 xHigh Grok 4 vs Claude Opus 4.7 (Adaptive)Grok 4 vs GPT-5.5 Grok 4 vs Gemini 3.5 Flash High Grok 4 vs Claude Mythos Preview Grok 4 vs GPT-5.4 Grok 4 vs Kimi K2.6 Thinking

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

xAI introduces its coding agent called Grok Build

xAI Introduces Its Coding Agent Called Grok Build Latest News AI Apps Computing Mobile Social Media EVs & Transportation Reviews Smartphones Laptops & PCs Gaming Headphones Wearables Photography Tablets Home Buying Guides Laptops Headphones Smart Home Gaming Gaming Nintendo PC PlayStation Xbox Big Tech Amazon Apple Google Meta Microsoft Samsung Entertainment TV & Movies Streaming Cybersecurity VPN Wearables Tomorrow Science Space Robotics Newsletter About Editorial Policies Reviews Policy Podcas

SpaceX has listed Grok's 'spicy' mode as a risk in its initial public offering: 'These modes may be more irreverent and harsher than our standard offerings'

SpaceX has listed Grok's 'spicy' mode as a risk in its initial public offering: 'These modes may be more irreverent and harsher than our standard offerings' | PC Gamer @layer legacy .legacy-container:after .legacy-container.full{clear:both;max-width:100%}@media screen and (min-width:1000px){.legacy-container.full{max-width:97

Hey @meta.ai is that true? Threads is testing a Grok-like AI feature

Hey @Meta.ai Is That True? Threads Is Testing A Grok-Like AI Feature Latest News AI Apps Computing Mobile Social Media EVs & Transportation Reviews Smartphones Laptops & PCs Gaming Headphones Wearables Photography Tablets Home Buying Guides Laptops Headphones Smart Home Gaming Gaming Nintendo PC PlayStation Xbox Big Tech Amazon Apple Google Meta Microsoft Samsung Entertainment TV & Movies Streaming Cybersecurity VPN Wearables Tomorrow Science Space Robotics About Editorial Policies Reviews Polic

AWS reportedly to tuck Elon Musk's Grok into Bedrock, despite zero enterprise demand

AWS reportedly to tuck Elon Musk's Grok into Bedrock, despite zero enterprise demand .bg-secondary.op-bg_20 .bg-secondary.op-bg_40 .bg-secondary.op-bg_60 .bg-secondary.op-bg_80 .bg-tertiary.op-bg_20 .bg-tertiary.op-bg_40 .bg-tertiary.op-bg_60 .bg-tertiary.op-bg_80 .bg-quaternary.op-bg_20 .bg-quaternary.op-bg_40 .bg-quaternary.op-bg_60 .bg-quaternary.op-bg_80 .bg-quinary.op-bg_20 .bg-quinary.op-bg_40 .bg-quinary.op-bg_60 .bg-quinary.op-bg_80 .bg-senary.op-bg_20 .bg-senary.op-bg_40 .bg-senary.op-b

Last cached leaderboard date: May 28, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.