HLE
34.7 score
AskClash combines public benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.
Only benchmark columns with cached public values are shown here. Missing cells remain blank in the live table.
34.7 score
90.5 score
80.2 score
66.7 score
89.6 score
73.1 score
44.9 score
80.4 score
79.4 score
Use these links to compare nearby frontier and open-weight models from the same AI leaderboard data.
Cached AskClash article matches that can provide release, provider, or market context around this model.
Kimi K2.6 🚀, Codex Chronicle 🤖, Bezos’ $10B AI fundraise 💰 TLDR Newsletters Advertise TLDR TLDR AI 2026-04-21 Kimi K2.6 🚀, Codex Chronicle 🤖, Bezos’ $10B AI fundraise 💰 Your AI agents are already operating outside scope (Sponsor) New Cloud Security Alliance (CSA) research makes it clear: 47% of organizations have already experienced a security incident involving an AI agent. 53% report agents regularly exceeding intended permissions. And 87% of enterprises run two or more AI agent platforms. Eve

Nathan Lambert - Interconnects

Latent Space
* server: apply format when think=false with thinking-capable parser by @ParthSareen in https://github.com/ollama/ollama/pull/15678 * launch: add kimi cli integration with installer flow by @ParthSareen in https://github.com/ollama/ollama/pull/15723
Last cached leaderboard date: May 22, 2026. This model page is generated from the AskClash AI Leaderboard cache and linked from the live leaderboard.