LLM Leaderboard · Proprietary

Kimi K2.5 (Reasoning) benchmarks, pricing, and LLM comparison.

Compare Kimi K2.5 (Reasoning) vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Rank #33AskClash overall score: 18.0
$0.60 / $3.00Input and output token price, when published. Context: 256K.
APIBilling and access path cached for this model row.

Kimi K2.5 (Reasoning) benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall18.0
Benchmark cells4
Context256K
CreatorMoonshot AI

Kimi K2.5 (Reasoning) public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

GPQA

87.6 score

SWE-bench

76.8 score

MMMU-Pro

78.5 score

Tau2

95.9 score

Kimi K2.5 (Reasoning) vs other AI models

Use these comparison links to evaluate Kimi K2.5 (Reasoning) against nearby LLMs by benchmark score, price, context window, and provider.

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

HuggingFace Transformers v5.5.4 Release Notes

** Fix Kimi-K2.5 tokenizer regression and _patch_mistral_regex Attribute… (#45305) by ArthurZucker ** Fix Qwen2.5-VL temporal RoPE scaling applied to still images (#45330) by Kash6, zucchini-nlp

Last cached leaderboard date: June 9, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.