How does MiniMax-M3 compare with other LLMs?

AskClash compares MiniMax-M3 against nearby AI models using public benchmark scores, pricing, context window, and access details.

What benchmarks are tracked for MiniMax-M3?

The page shows cached public benchmark cells such as HLE, GPQA, SWE-bench, SWE-Pro, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and related model scores when available.

LLM Leaderboard · Open Weight

MiniMax-M3 benchmarks, pricing, and LLM comparison.

Compare MiniMax-M3 vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Open live LLM leaderboard Open in app

Rank #9AskClash overall score: 67.4

$0.30 / $1.20Input and output token price, when published. Context: 1M.

API/OAuthBilling and access path cached for this model row.

MiniMax-M3 benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall67.4

Benchmark cells13

Context1M

CreatorMiniMax

MiniMax-M3 public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

RWT

9.5 score

HLE

37.1 score

GPQA

92.9 score

IFEval

82.9 score

SWE-bench

80.5 score

SWE-Pro

59.0 score

Terminal-Bench

66.0 score

OSWorld

70.1 score

MCP Atlas

74.2 score

Finance Agent

48.3 score

MMMU-Pro

78.1 score

Tau2

88.9 score

MiniMax-M3 vs other AI models

Use these comparison links to evaluate MiniMax-M3 against nearby LLMs by benchmark score, price, context window, and provider.

Claude Mythos/Fable 5 vs MiniMax-M3 Claude Opus 4.8 (Adaptive) vs MiniMax-M3 GPT-5.5 xHigh vs MiniMax-M3 GLM-5.2 vs MiniMax-M3 Claude Opus 4.7 (Adaptive) vs MiniMax-M3 Qwen3.7 Max vs MiniMax-M3 GPT-5.4 xHigh vs MiniMax-M3 Kimi K2.7 Code vs MiniMax-M3

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

HuggingFace Transformers v5.12.0 Release Notes

MiniMax-M3-VL is the vision-language member of the MiniMax-M3 family that pairs a CLIP-style vision tower with 3D rotary position embeddings with the MiniMax-M3 text backbone. It uses a mixed dense/sparse Mixture-of-Experts decoder with SwiGLU-OAI gated experts and a lightning indexer for block-sparse attention. The model processes images through a Conv3d patch embedding system and includes specialized components for efficient multimodal understanding and generation. The official weights for PP-

vLLM Inference Engine v0.19.0 Release Notes

* Model fixes: NemotronH MTP + Chunked Prefill (#35447), Qwen3-VL video timestamps (#37439), Qwen3.5 GDN quantized models (#37448), Qwen3Next A_log FP32 (#37810), JAIS ALiBi (#37820), RoBERTa CUDA graph position IDs (#37873), AudioFlamingo3/MusicFlamingo (#37643), Music Flamingo loading (#35535), bge-m3 task selection (#37632), Nemotron Parse loading (#37407), GLM OCR patch merger (#37962), PaddleOCR checkpoint compat (#38232), DeepSeek v3.2 params (#33703), MiniMax NVFP4 weight loading (#37214)

vLLM Inference Engine v0.23.0 Release Notes

* **DeepSeek-V4 matures across backends**: Following its introduction in v0.22.0, DeepSeek-V4 received another large hardening and optimization pass. Its sparse MLA metadata is now decoupled from DeepSeek-V3.2 (#44699), it gained a TRTLLM-gen attention kernel (#43827), EPLB support for the Mega-MoE (#43339), selective prefix-cache retention for sliding-window KV cache (#43447), and an index-share feature for DSA MTP (#44420). The model was also detached from `torch.compile` (#43746, #43891), its

Last cached leaderboard date: June 1, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.