LLM Leaderboard ยท Proprietary

Grok 4.20 benchmarks, pricing, and LLM comparison.

Compare Grok 4.20 vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Rank #35AskClash overall score: 42.2
$2.00 / $6.00Input and output token price, when published. Context: 2M.
API/OAuthBilling and access path cached for this model row.

Grok 4.20 benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall42.2
Benchmark cells9
Context2M
CreatorxAI

Grok 4.20 public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

HLE

32.2 score

GPQA

91.1 score

IFEval

81.2 score

SWE-bench

76.7 score

Terminal-Bench

37.9 score

CharXiv

60.9 score

MMMU-Pro

75.2 score

ARC-AGI 2

53.3 score

Tau2

59.9 score

Grok 4.20 vs other AI models

Use these comparison links to evaluate Grok 4.20 against nearby LLMs by benchmark score, price, context window, and provider.

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

xAI introduces its coding agent called Grok Build

xAI Introduces Its Coding Agent Called Grok Build Latest News AI Apps Computing Mobile Social Media EVs & Transportation Reviews Smartphones Laptops & PCs Gaming Headphones Wearables Photography Tablets Home Buying Guides Laptops Headphones Smart Home Gaming Gaming Nintendo PC PlayStation Xbox Big Tech Amazon Apple Google Meta Microsoft Samsung Entertainment TV & Movies Streaming Cybersecurity VPN Wearables Tomorrow Science Space Robotics Newsletter About Editorial Policies Reviews Policy Podcas

Grok Build ๐Ÿ‘จโ€๐Ÿ’ป , Codex customizations ๐Ÿค–, xAI exodus ๐Ÿ‘‹

Grok Build ๐Ÿ‘จโ€๐Ÿ’ป , Codex customizations ๐Ÿค–, xAI exodus ๐Ÿ‘‹ TLDR Newsletters Advertise Blog TLDR TLDR AI 2026-05-15 Grok Build ๐Ÿ‘จโ€๐Ÿ’ป , Codex customizations ๐Ÿค–, xAI exodus ๐Ÿ‘‹ It would have taken at least 30 minutes to find root cause. Seer Agent had it in seconds (Sponsor) It looked like Saturday night would be very bad. Sentry's Head of AI, Indragie, was online when Seer, Sentry's AI debugger, had started failing. The issue was an upstream infra outage on the provider's side - but there was no way to figu

Grok 4.3 ๐Ÿค–, Claude security beta ๐Ÿ›ก๏ธ, Cursor xAI analysis ๐Ÿ“

Grok 4.3 ๐Ÿค–, Claude security beta ๐Ÿ›ก๏ธ, Cursor xAI analysis ๐Ÿ“ TLDR Newsletters Advertise TLDR TLDR AI 2026-05-01 Grok 4.3 ๐Ÿค–, Claude security beta ๐Ÿ›ก๏ธ, Cursor xAI analysis ๐Ÿ“ Don't let your keyboard slow your coding agents down (Sponsor) The best coding agents need context to get it right, but typing takes time. Wispr Flow lets you speak context into Cursor, Claude Code, Codex, and any AI tool. The best part: it's 4x faster than typing. Describe what you want built, explain the edge cases, and give ag

Last cached leaderboard date: May 25, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.