LLM Leaderboard · Proprietary

Grok 4 leaderboard — benchmarks, pricing, and comparisons.

Compare Grok 4 vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Rank #35AskClash overall score: 34.5
$1.25 / $2.50Input and output token price, when published. Context: 1M.
API/OAuthBilling and access path cached for this model row.

Grok 4 benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall34.5
Benchmark cells7
Context1M
CreatorxAI

Grok 4 public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

HLE

23.9 score

GPQA

87.7 score

MATH-500

99.0 score

IFEval

53.7 score

LiveCodeBench

81.9 score

ARC-AGI 2

15.9 score

Tau2

74.9 score

Grok 4 vs other AI models

Use these comparison links to evaluate Grok 4 against nearby LLMs by benchmark score, price, context window, and provider.

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

Last cached leaderboard date: June 18, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.