LLM Leaderboard · Open Weight

Gemma 4 31B benchmarks, pricing, and LLM comparison.

Compare Gemma 4 31B vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Rank #23AskClash overall score: 46.6
$0 / $0Input and output token price, when published. Context: 256K.
API/OAuthBilling and access path cached for this model row.

Gemma 4 31B benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall46.6
Benchmark cells5
Context256K
CreatorGoogle

Gemma 4 31B public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

HLE

26.5 score

GPQA

84.3 score

LiveCodeBench

80.0 score

MMMU-Pro

76.9 score

Tau2

86.4 score

Gemma 4 31B vs other AI models

Use these comparison links to evaluate Gemma 4 31B against nearby LLMs by benchmark score, price, context window, and provider.

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

Ollama v0.23.1 Release Notes

Gemma 4 MTP speculative decoding is now supported on Macs. This can give over a 2x speed increase for the Gemma 4 31B model on coding tasks. * Update MLX and MLX-C with threading fixes by @dhiltgen in https://github.com/ollama/ollama/pull/15845

llm-gemini 0.30

Release: llm-gemini 0.30 Simon Willison’s Weblog Subscribe Sponsored by: WorkOS — Production-ready APIs for auth and access control, so you can ship faster. 2nd April 2026 Release llm-gemini 0.30 — LLM plugin to access Google's Gemini family of models New models gemini-3.1-flash-lite-preview , gemma-4-26b-a4b-it and gemma-4-31b-it . See my notes on Gemma 4 . Posted 2nd April 2026 at 6:25 pm Recent articles Highlights from my conversation about agentic engineering on Lenny's Podcast - 2nd April 2

Gemma 4 audio with MLX

Gemma 4 audio with MLX Simon Willison’s Weblog Subscribe Sponsored by: Teleport — Connect agents to your infra in seconds with Teleport Beams. Built-in identity. Zero secrets. Get early access 12th April 2026 Thanks to a tip from Rahim Nathwani , here's a uv run recipe for transcribing an audio file on macOS using the 10.28 GB Gemma 4 E2B model with MLX and mlx-vlm : uv run --python 3.13 --with mlx_vlm --with torchvision --with gradio \ mlx_vlm.generate \ --model google/gemma-4-e2b-it \ --audio

Last cached leaderboard date: June 18, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.