LLM Leaderboard · AI model

GLM 5.1 Thinking benchmarks, pricing, and LLM comparison.

Compare GLM 5.1 Thinking vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Rank #24AskClash overall score: 30.6
— / —Input and output token price, when published. Context: —.
APIBilling and access path cached for this model row.

GLM 5.1 Thinking benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall30.6
Benchmark cells7
Context
CreatorZhipu AI

GLM 5.1 Thinking public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

HLE

31.0 score

GPQA

86.2 score

SWE-Pro

58.4 score

Terminal-Bench

63.5 score

MCP Atlas

71.8 score

Finance Agent

44.8 score

Tau2

70.6 score

GLM 5.1 Thinking vs other AI models

Use these comparison links to evaluate GLM 5.1 Thinking against nearby LLMs by benchmark score, price, context window, and provider.

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

SGLang — RadixAttention Inference Server v0.5.11 Release Notes

- **CUDA 13 + Torch 2.11**: Default CUDA version moves to 13.0 across SGLang, sgl-kernel, and Docker images, and PyTorch is upgraded from 2.9 to 2.11 — modernizing the build matrix and unlocking newer kernels: #21247, #24162, #24183, #23593 ([tracking issue #21498](https://github.com/sgl-project/sglang/issues/21498)) - **Day-0 / New Model Support**: Gemma 4, GLM-5.1, Qwen3.6, MiMo-V2.5 / V2.5-Pro, Ling-2.6-Flash, Mistral Medium 3.5, and Kimi-K2.6 — with cookbook recipes for tuned deployment comm

Anthropic's Fable AI brings the capabilities of its unreleased Mythos model to regular users

Anthropic's Fable AI Brings The Capabilities Of Its Unreleased Mythos Model To Regular Users Latest News AI Apps Computing Mobile Social Media EVs & Transportation Reviews Smartphones Laptops & PCs Gaming Headphones Wearables Photography Tablets Home Buying Guides Laptops Headphones Smart Home Gaming Gaming Nintendo PC PlayStation Xbox Big Tech Amazon Apple Google Meta Microsoft Samsung Entertainment TV & Movies Streaming Cybersecurity VPN Wearables Tomorrow Science Space Robotics Newsletter Abo

Import AI 454: Automating alignment research; safety study of a Chinese model; HiFloat4

Import AI 454: Automating alignment research; safety study of a Chinese model; HiFloat4 <img style="object-fit:cover;width:40px;height:40px;" src="https://substackcdn.com/image/fetch/$s_!3yYS!,w_40,h_40,c_fill,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6d17996-2bef-40a4-abe3-be72a0e8a227_258x258.png" srcset="https://substackcdn.com/image/fetch/$s_!3yYS!,w_40,h_40,c_fill,

Last cached leaderboard date: June 9, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.