LLM Leaderboard · Open Weight

Nemotron 3 Ultra benchmarks, pricing, and LLM comparison.

Compare Nemotron 3 Ultra vs GPT, Claude, Gemini, DeepSeek, open-weight, and frontier AI models using public benchmark scores, token pricing, context window, and access details.

Rank #31AskClash overall score: 20.1
$0 / $0Input and output token price, when published. Context: 1M.
APIBilling and access path cached for this model row.

Nemotron 3 Ultra benchmark snapshot

AskClash combines public LLM benchmark cells into a weighted percentile score and penalizes missing coverage so narrow rows do not dominate better-measured models.

Overall20.1
Benchmark cells6
Context1M
CreatorNVIDIA

Nemotron 3 Ultra public benchmark scores

Cached benchmark values can include HLE, GPQA, SWE-bench, SWE-Pro, SWE-Atlas, Terminal-Bench, MCP Atlas, MMMU-Pro, ARC-AGI-2, Tau2, and model-specific coding or agent scores.

HLE

26.7 score

GPQA

87.0 score

IFEval

81.7 score

SWE-bench

71.9 score

Terminal-Bench

56.4 score

LiveCodeBench

89.0 score

Nemotron 3 Ultra vs other AI models

Use these comparison links to evaluate Nemotron 3 Ultra against nearby LLMs by benchmark score, price, context window, and provider.

Related AI and tech coverage

Cached AskClash article matches that can provide release, provider, benchmark, pricing, or market context around this model.

EleutherAI GPT-NeoX — Repository Overview

This repository records [EleutherAI](https://www.eleuther.ai)'s library for training large-scale language models on GPUs. Our current framework is based on NVIDIA's [Megatron Language Model](https://github.com/NVIDIA/Megatron-LM) and has been augmented with techniques from [DeepSpeed](https://www.deepspeed.ai) as well as some novel optimizations. We aim to make this repo a centralized and accessible place to gather techniques for training large-scale autoregressive language models, and accelerat

Anthropic Launches Claude Fable 5 Amid DeFi Security Concerns

Anthropic, an artificial intelligence (AI) safety and research company known for developing Claude, has launched Claude Fable 5, its most powerful AI model, now available to the public. Anthropic said the company removed some advanced cybersecurity features, which are capable of finding software vulnerabilities on their own. Alongside Fable 5, Anthropic also introduced Claude Mythos 5, a more powerful version of the model. The company said it can find unknown software vulnerabilities on its own,

vLLM Inference Engine — Repository Overview

| <a href="https://docs.vllm.ai"><b>Documentation</b></a> | <a href="https://blog.vllm.ai/"><b>Blog</b></a> | <a href="https://arxiv.org/abs/2309.06180"><b>Paper</b></a> | <a href="https://x.com/vllm_project"><b>Twitter/X</b></a> | <a href="https://discuss.vllm.ai"><b>User Forum</b></a> | <a href="https://slack.vllm.ai"><b>Developer Slack</b></a> | - Support for NVIDIA GPUs, AMD CPUs and GPUs, Intel CPUs and GPUs, PowerPC CPUs, Arm CPUs, and TPU. Additionally, support for diverse hardware plugin

Last cached leaderboard date: June 9, 2026. This model page is generated from the AskClash LLM Leaderboard cache and linked from the live leaderboard.