How does Gemma 4 31B compare with other LLMs?

AskClash compares Gemma 4 31B against nearby AI models using public benchmark scores, pricing, context window, and access details.

What benchmarks are tracked for Gemma 4 31B?

The page shows cached public benchmark cells such as HLE, GPQA, SWE-bench, SWE-Pro, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and related model scores when available.

LLM Leaderboard · Open Weight

Gemma 4 31B benchmarks, pricing, and LLM comparison.

Compare Gemma 4 31B vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Open live LLM leaderboard Open in app

Rank #23AskClash overall score: 46.6

$0 / $0Input and output token price, when published. Context: 256K.

API/OAuthBilling and access path cached for this model row.

Gemma 4 31B benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall46.6

Benchmark cells5

Context256K

CreatorGoogle

Gemma 4 31B public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

HLE

26.5 score

GPQA

84.3 score

LiveCodeBench

80.0 score

MMMU-Pro

76.9 score

Tau2

86.4 score

Gemma 4 31B vs other AI models

Use these comparison links to evaluate Gemma 4 31B against nearby LLMs by benchmark score, price, context window, and provider.

Claude Mythos/Fable 5 vs Gemma 4 31B Claude Opus 4.8 (Adaptive) vs Gemma 4 31B GPT-5.5 xHigh vs Gemma 4 31B GLM-5.2 vs Gemma 4 31B Claude Opus 4.7 (Adaptive) vs Gemma 4 31B Qwen3.7 Max vs Gemma 4 31B GPT-5.4 xHigh vs Gemma 4 31B Kimi K2.7 Code vs Gemma 4 31B

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

Fine-Tuning and Serving Gemma 4 31B on Google Cloud TPU: A Technical Comparison with GPU Baselines

Ollama v0.23.1 Release Notes

Gemma 4 MTP speculative decoding is now supported on Macs. This can give over a 2x speed increase for the Gemma 4 31B model on coding tasks. * Update MLX and MLX-C with threading fixes by @dhiltgen in https://github.com/ollama/ollama/pull/15845

llm-gemini 0.30

Release: llm-gemini 0.30 Simon Willison’s Weblog Subscribe Sponsored by: WorkOS — Production-ready APIs for auth and access control, so you can ship faster. 2nd April 2026 Release llm-gemini 0.30 — LLM plugin to access Google's Gemini family of models New models gemini-3.1-flash-lite-preview , gemma-4-26b-a4b-it and gemma-4-31b-it . See my notes on Gemma 4 . Posted 2nd April 2026 at 6:25 pm Recent articles Highlights from my conversation about agentic engineering on Lenny's Podcast - 2nd April 2

Gemma 4 audio with MLX

Gemma 4 audio with MLX Simon Willison’s Weblog Subscribe Sponsored by: Teleport — Connect agents to your infra in seconds with Teleport Beams. Built-in identity. Zero secrets. Get early access 12th April 2026 Thanks to a tip from Rahim Nathwani , here's a uv run recipe for transcribing an audio file on macOS using the 10.28 GB Gemma 4 E2B model with MLX and mlx-vlm : uv run --python 3.13 --with mlx_vlm --with torchvision --with gradio \ mlx_vlm.generate \ --model google/gemma-4-e2b-it \ --audio

Last cached leaderboard date: June 18, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.