How does Kimi K2.5 compare with other LLMs?

AskClash compares Kimi K2.5 against nearby AI models using public benchmark scores, pricing, context window, and access details.

What benchmarks are tracked for Kimi K2.5?

The page shows cached public benchmark cells such as HLE, GPQA, SWE-bench, SWE-Pro, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and related model scores when available.

LLM Leaderboard · Open Weight

Kimi K2.5 benchmarks, pricing, and LLM comparison.

Compare Kimi K2.5 vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Open live LLM leaderboard Open in app

Rank #25AskClash overall score: 45.3

$0.60 / $3.00Input and output token price, when published. Context: 256K.

APIBilling and access path cached for this model row.

Kimi K2.5 benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall45.3

Benchmark cells13

Context256K

CreatorMoonshot AI

Kimi K2.5 public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

HLE

30.1 score

GPQA

87.6 score

MATH-500

82.0 score

IFEval

93.9 score

SWE-bench

76.8 score

SWE-Pro

50.7 score

Terminal-Bench

50.8 score

LiveCodeBench

85.0 score

OSWorld

63.3 score

MCP Atlas

29.5 score

CharXiv

77.5 score

MMMU-Pro

78.5 score

Tau2

95.9 score

Kimi K2.5 vs other AI models

Use these comparison links to evaluate Kimi K2.5 against nearby LLMs by benchmark score, price, context window, and provider.

Kimi K2.5 vs GPT-5.5 xHigh Kimi K2.5 vs GPT-5.5 Kimi K2.5 vs Claude Opus 4.7 (Adaptive)Kimi K2.5 vs Gemini 3.5 Flash Kimi K2.5 vs GPT-5.4 Kimi K2.5 vs Claude Mythos Preview Kimi K2.5 vs Claude Opus 4.7 Kimi K2.5 vs Qwen3.7 Max

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

HuggingFace Transformers v5.5.4 Release Notes

** Fix Kimi-K2.5 tokenizer regression and _patch_mistral_regex Attribute… (#45305) by ArthurZucker ** Fix Qwen2.5-VL temporal RoPE scaling applied to still images (#45330) by Kash6, zucchini-nlp

[AINews] Moonshot Kimi K2.6: the world's leading Open Model refreshes to catch up to Opus 4.6 (ahead of DeepSeek v4?)

Latent Space

Kimi K2.6 🚀, Codex Chronicle 🤖, Bezos’ $10B AI fundraise 💰

Kimi K2.6 🚀, Codex Chronicle 🤖, Bezos’ $10B AI fundraise 💰 TLDR Newsletters Advertise TLDR TLDR AI 2026-04-21 Kimi K2.6 🚀, Codex Chronicle 🤖, Bezos’ $10B AI fundraise 💰 Your AI agents are already operating outside scope (Sponsor) New Cloud Security Alliance (CSA) research makes it clear: 47% of organizations have already experienced a security incident involving an AI agent. 53% report agents regularly exceeding intended permissions. And 87% of enterprises run two or more AI agent platforms. Eve

AgentFlow specs.py — Core Data Models (Pydantic Specs for Pipelines, Nodes, Agents)

Pydantic data models defining AgentFlow's type system: AgentKind (codex/claude/kimi/python/shell/sync), NodeSpec, PipelineSpec, ProviderConfig, target types (local/SSH/EC2/ECS/container), fanout expansion, MCP server specs.

Last cached leaderboard date: May 22, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.