LLM Leaderboard · Open Weight

Qwen3.5 397B benchmarks, pricing, and LLM comparison.

Compare Qwen3.5 397B vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Rank #23AskClash overall score: 31.9
$0.60 / $3.60Input and output token price, when published. Context: 128K.
APIBilling and access path cached for this model row.

Qwen3.5 397B benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall31.9
Benchmark cells9
Context128K
CreatorAlibaba

Qwen3.5 397B public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

HLE

28.7 score

GPQA

88.4 score

IFEval

92.6 score

SWE-bench

76.2 score

LiveCodeBench

83.6 score

MCP Atlas

46.1 score

CharXiv

80.8 score

MMMU-Pro

79.0 score

Tau2

83.9 score

Qwen3.5 397B vs other AI models

Use these comparison links to evaluate Qwen3.5 397B against nearby LLMs by benchmark score, price, context window, and provider.

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

Pentagon adds Alibaba and Baidu to list of firms linked to Chinese military

Pentagon Adds Alibaba And Baidu To List Of Firms Linked To Chinese Military Latest News AI Apps Computing Mobile Social Media EVs & Transportation Reviews Smartphones Laptops & PCs Gaming Headphones Wearables Photography Tablets Home Buying Guides Laptops Headphones Smart Home Gaming Gaming Nintendo PC PlayStation Xbox Big Tech Amazon Apple Google Meta Microsoft Samsung Entertainment TV & Movies Streaming Cybersecurity VPN Wearables Tomorrow Science Space Robotics Newsletter About Editorial Poli

Shifting to AI model customization is an architectural imperative

In the early days of large language models (LLMs), we grew accustomed to massive 10x jumps in reasoning and coding capability with every new model iteration. Today, those jumps have flattened into incremental gains. The exception is domain-specialized intelligence, where true step-function improvements are still the norm. When a model is fused with an organization’s…

Last cached leaderboard date: June 9, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.