How does Gemma 4 31B compare with other LLMs?

AskClash compares Gemma 4 31B against nearby AI models using public benchmark scores, pricing, context window, and access details.

What benchmarks are tracked for Gemma 4 31B?

The page shows cached public benchmark cells such as HLE, GPQA, SWE-bench, SWE-Pro, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and related model scores when available.

LLM Leaderboard · Open Weight

Gemma 4 31B benchmarks, pricing, and LLM comparison.

Compare Gemma 4 31B vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Open live LLM leaderboard Open in app

Rank #37AskClash overall score: 38.7

$0 / $0Input and output token price, when published. Context: 256K.

API/OAuthBilling and access path cached for this model row.

Gemma 4 31B benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall38.7

Benchmark cells7

Context256K

CreatorGoogle

Gemma 4 31B public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

HLE

26.5 score

GPQA

84.3 score

IFEval

75.6 score

Terminal-Bench

36.4 score

LiveCodeBench

80.0 score

MMMU-Pro

76.9 score

Tau2

86.4 score

Gemma 4 31B vs other AI models

Use these comparison links to evaluate Gemma 4 31B against nearby LLMs by benchmark score, price, context window, and provider.

Gemma 4 31B vs Claude Opus 4.8 (Adaptive)Gemma 4 31B vs GPT-5.5 xHigh Gemma 4 31B vs Claude Opus 4.7 (Adaptive)Gemma 4 31B vs GPT-5.5 Gemma 4 31B vs Gemini 3.5 Flash High Gemma 4 31B vs Claude Mythos Preview Gemma 4 31B vs GPT-5.4 Gemma 4 31B vs Kimi K2.6 Thinking

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

Fine-Tuning and Serving Gemma 4 31B on Google Cloud TPU: A Technical Comparison with GPU Baselines

Ollama v0.23.1 Release Notes

Gemma 4 MTP speculative decoding is now supported on Macs. This can give over a 2x speed increase for the Gemma 4 31B model on coding tasks. * Update MLX and MLX-C with threading fixes by @dhiltgen in https://github.com/ollama/ollama/pull/15845

llm-gemini 0.30

Release: llm-gemini 0.30 Simon Willison’s Weblog Subscribe Sponsored by: WorkOS — Production-ready APIs for auth and access control, so you can ship faster. 2nd April 2026 Release llm-gemini 0.30 — LLM plugin to access Google's Gemini family of models New models gemini-3.1-flash-lite-preview , gemma-4-26b-a4b-it and gemma-4-31b-it . See my notes on Gemma 4 . Posted 2nd April 2026 at 6:25 pm Recent articles Highlights from my conversation about agentic engineering on Lenny's Podcast - 2nd April 2

Google's Gemma 4 AI models get 3x speed boost by predicting future tokens

Ars Technica

Last cached leaderboard date: May 25, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.