How does GLM 5.1 Thinking compare with other LLMs?

AskClash compares GLM 5.1 Thinking against nearby AI models using public benchmark scores, pricing, context window, and access details.

What benchmarks are tracked for GLM 5.1 Thinking?

The page shows cached public benchmark cells such as HLE, GPQA, SWE-bench, SWE-Pro, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and related model scores when available.

LLM Leaderboard · AI model

GLM 5.1 Thinking benchmarks, pricing, and LLM comparison.

Compare GLM 5.1 Thinking vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Open live LLM leaderboard Open in app

Rank #33AskClash overall score: 42.5

$1.40 / $4.40Input and output token price, when published. Context: 200k.

APIBilling and access path cached for this model row.

GLM 5.1 Thinking benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall42.5

Benchmark cells7

Context200k

CreatorZhipu AI

GLM 5.1 Thinking public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

HLE

28.0 score

GPQA

86.2 score

IFEval

76.3 score

Terminal-Bench

63.5 score

MCP Atlas

71.8 score

Finance Agent

44.8 score

Tau2

97.7 score

GLM 5.1 Thinking vs other AI models

Use these comparison links to evaluate GLM 5.1 Thinking against nearby LLMs by benchmark score, price, context window, and provider.

GLM 5.1 Thinking vs GPT-5.5 xHigh GLM 5.1 Thinking vs Claude Opus 4.7 (Adaptive)GLM 5.1 Thinking vs GPT-5.5 GLM 5.1 Thinking vs GPT-5.4 GLM 5.1 Thinking vs Gemini 3.5 Flash High GLM 5.1 Thinking vs Composer 2.5 GLM 5.1 Thinking vs Claude Mythos Preview GLM 5.1 Thinking vs Kimi K2.6 Thinking

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

Latest open artifacts (#21): Open model bonanza! Gemma 4, DeepSeek V4, Kimi K2.6, MiMo 2.5, GLM-5.1 & others. On CAISI's V4 assessment.

Nathan Lambert - Interconnects

SGLang — RadixAttention Inference Server v0.5.11 Release Notes

- **CUDA 13 + Torch 2.11**: Default CUDA version moves to 13.0 across SGLang, sgl-kernel, and Docker images, and PyTorch is upgraded from 2.9 to 2.11 — modernizing the build matrix and unlocking newer kernels: #21247, #24162, #24183, #23593 ([tracking issue #21498](https://github.com/sgl-project/sglang/issues/21498)) - **Day-0 / New Model Support**: Gemma 4, GLM-5.1, Qwen3.6, MiMo-V2.5 / V2.5-Pro, Ling-2.6-Flash, Mistral Medium 3.5, and Kimi-K2.6 — with cookbook recipes for tuned deployment comm

Zhipu.AI’s Open-Source Power Play: Blazing-Fast GLM Models & Global Expansion Ahead of Potential IPO

Beijing, China – April 15, 2025 – In a strategic move that underscores its technological prowess and global ambitions, potentially paving the way for a future IPO, Chinese AI company Zhipu.AI has announced the comprehensive open-sourcing of its next-generation General Language Models (GLM). This release includes the advanced GLM-4 series and the groundbreaking GLM-Z1 inference

ScaleAcross Explorer: Exploring Communication Optimization for Scale-Across AI Model Training

Last cached leaderboard date: May 25, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.