HLE
53.0 score
AskClash combines public benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.
Only benchmark columns with cached public values are shown here. Missing cells remain blank in the live table.
53.0 score
91.3 score
98.0 score
80.8 score
11.8 score
70.2 score
72.7 score
77.3 score
68.8 score
84.8 score
Use these links to compare nearby frontier and open-weight models from the same AI leaderboard data.
Cached AskClash article matches that can provide release, provider, or market context around this model.

Latent Space
Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP Source: arXiv Logic / Formal Methods URL: https://arxiv.org/abs/2603.20405

Nathan Lambert - Interconnects

Latent Space
Last cached leaderboard date: May 22, 2026. This model page is generated from the AskClash AI Leaderboard cache and linked from the live leaderboard.