HLE
26.5 score
Compare Gemma 4 31B vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.
AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.
Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.
26.5 score
84.3 score
75.6 score
36.4 score
80.0 score
76.9 score
86.4 score
Use these comparison links to evaluate Gemma 4 31B against nearby LLMs by benchmark score, price, context window, and provider.
Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.
Fine-Tuning and Serving Gemma 4 31B on Google Cloud TPU: A Technical Comparison with GPU Baselines
Gemma 4 MTP speculative decoding is now supported on Macs. This can give over a 2x speed increase for the Gemma 4 31B model on coding tasks. * Update MLX and MLX-C with threading fixes by @dhiltgen in https://github.com/ollama/ollama/pull/15845
Release: llm-gemini 0.30 Simon Willison’s Weblog Subscribe Sponsored by: WorkOS — Production-ready APIs for auth and access control, so you can ship faster. 2nd April 2026 Release llm-gemini 0.30 — LLM plugin to access Google's Gemini family of models New models gemini-3.1-flash-lite-preview , gemma-4-26b-a4b-it and gemma-4-31b-it . See my notes on Gemma 4 . Posted 2nd April 2026 at 6:25 pm Recent articles Highlights from my conversation about agentic engineering on Lenny's Podcast - 2nd April 2

Ars Technica
Last cached leaderboard date: May 25, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.