HLE
34.7 score
AskClash combines public benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.
Only benchmark columns with cached public values are shown here. Missing cells remain blank in the live table.
34.7 score
90.5 score
80.2 score
27.3 score
64.3 score
89.6 score
73.1 score
55.9 score
80.4 score
79.4 score
95.9 score
Use these links to compare nearby frontier and open-weight models from the same AI leaderboard data.
Cached AskClash article matches that can provide release, provider, or market context around this model.
Kimi K2.6 🚀, Codex Chronicle 🤖, Bezos’ $10B AI fundraise 💰 TLDR Newsletters Advertise TLDR TLDR AI 2026-04-21 Kimi K2.6 🚀, Codex Chronicle 🤖, Bezos’ $10B AI fundraise 💰 Your AI agents are already operating outside scope (Sponsor) New Cloud Security Alliance (CSA) research makes it clear: 47% of organizations have already experienced a security incident involving an AI agent. 53% report agents regularly exceeding intended permissions. And 87% of enterprises run two or more AI agent platforms. Eve

Nathan Lambert - Interconnects

Latent Space
- **CUDA 13 + Torch 2.11**: Default CUDA version moves to 13.0 across SGLang, sgl-kernel, and Docker images, and PyTorch is upgraded from 2.9 to 2.11 — modernizing the build matrix and unlocking newer kernels: #21247, #24162, #24183, #23593 ([tracking issue #21498](https://github.com/sgl-project/sglang/issues/21498)) - **Day-0 / New Model Support**: Gemma 4, GLM-5.1, Qwen3.6, MiMo-V2.5 / V2.5-Pro, Ling-2.6-Flash, Mistral Medium 3.5, and Kimi-K2.6 — with cookbook recipes for tuned deployment comm
Last cached leaderboard date: May 22, 2026. This model page is generated from the AskClash AI Leaderboard cache and linked from the live leaderboard.