HLE
64.7 score
Compare Claude Mythos Preview vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.
AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.
Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.
64.7 score
94.5 score
93.9 score
77.8 score
82.0 score
79.6 score
93.2 score
92.7 score
Use these comparison links to evaluate Claude Mythos Preview against nearby LLMs by benchmark score, price, context window, and provider.
Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

Behind the Scenes Hardening Firefox with Claude Mythos Preview Simon Willison’s Weblog Subscribe Sponsored by: MongoDB — Join MongoDB.local London 2026 on 7 May to learn how teams move AI from prototype to production. 7th May 2026 - Link Blog Behind the Scenes Hardening Firefox with Claude Mythos Preview ( via ) Fascinating, in-depth details on how Mozilla used their access to the Claude Mythos preview to locate and then fix hundreds of vulnerabilities in Firefox: Suddenly, the bugs are very goo

Latent Space

Anthropic’s Project Glasswing—restricting Claude Mythos to security researchers—sounds necessary to me Simon Willison’s Weblog Subscribe Sponsored by: WorkOS — Production-ready APIs for auth and access control, so you can ship faster. Anthropic’s Project Glasswing—restricting Claude Mythos to security researchers—sounds necessary to me 7th April 2026 Anthropic didn’t release their latest model, Claude Mythos ( system card PDF ), today. They have instead made it available to a very restricted set
Mythos, announced on April 7, is being deployed as part of Anthropic's "Project Glasswing," a controlled initiative under which select organizations, including major tech firms such as Amazon , Microsoft , Nvidia and Apple , are permitted to use the unreleased Claude Mythos Preview model for defensive cybersecurity purposes. The Pentagon is deploying Mythos to find and patch software vulnerabilities across the U.S. government even as it races to complete a transition away from the AI company,
Last cached leaderboard date: May 22, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.