How does Grok 4 compare with other LLMs?

AskClash compares Grok 4 against nearby AI models using public benchmark scores, pricing, context window, and access details.

What benchmarks are tracked for Grok 4?

The page shows cached public benchmark cells such as HLE, GPQA, SWE-bench, SWE-Pro, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and related model scores when available.

LLM Leaderboard · Proprietary

Grok 4 leaderboard — benchmarks, pricing, and comparisons.

Compare Grok 4 vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Open live LLM leaderboard Open in app

Rank #35AskClash overall score: 34.5

$1.25 / $2.50Input and output token price, when published. Context: 1M.

API/OAuthBilling and access path cached for this model row.

Grok 4 benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall34.5

Benchmark cells7

Context1M

CreatorxAI

Grok 4 public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

HLE

23.9 score

GPQA

87.7 score

MATH-500

99.0 score

IFEval

53.7 score

LiveCodeBench

81.9 score

ARC-AGI 2

15.9 score

Tau2

74.9 score

Grok 4 vs other AI models

Use these comparison links to evaluate Grok 4 against nearby LLMs by benchmark score, price, context window, and provider.

Claude Mythos/Fable 5 vs Grok 4 Claude Opus 4.8 (Adaptive) vs Grok 4 GPT-5.5 xHigh vs Grok 4 GLM-5.2 vs Grok 4 Claude Opus 4.7 (Adaptive) vs Grok 4 Qwen3.7 Max vs Grok 4 GPT-5.4 xHigh vs Grok 4 Kimi K2.7 Code vs Grok 4

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

Grok Is Still Hosting Sexualized Deepfakes of Famous Women

Wired

Massive Effigy of Elon Musk Raised Over Times Square to Protest Grok

Wired

Sensor Tower: ChatGPT's market share fell to 46.4% by the end of May, as Gemini rose to 27.7% and Claude to 10.3%; Grok, Meta AI, and others have less than 5% (Ivan Mehta/TechCrunch)

Techmeme

xAI fired an engineer who raised alarms about Grok safety, new lawsuit claims

TechCrunch

Last cached leaderboard date: June 18, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.