Name: nemotron-3-ultra-550b-a55b
Brand: NVIDIA
SKU: nvidia/nemotron-3-ultra-550b-a55b
Availability: InStock

Question 1

How much does nemotron-3-ultra-550b-a55b cost?

Accepted Answer

nemotron-3-ultra-550b-a55b is priced at Free per million input tokens and Free per million output tokens when accessed via Requesty.  Requesty charges exactly what the upstream provider charges, we don't add markup.

Question 2

What is the context window of nemotron-3-ultra-550b-a55b?

Accepted Answer

nemotron-3-ultra-550b-a55b has a context window of 1.0M tokens, with a maximum output of 66K tokens per response. That's roughly 1,398 words of input you can fit in a single prompt.

Question 3

How does nemotron-3-ultra-550b-a55b perform on benchmarks?

Accepted Answer

nemotron-3-ultra-550b-a55b scores 86.7% on GPQA Diamond, 83.3% on τ²-Bench, 49.3% on Coding Index. See the full benchmark chart above for results across MMLU Pro, GPQA Diamond, SWE-Bench Verified, HumanEval, MATH, AIME, MMMU, and LiveBench.

Question 4

What can nemotron-3-ultra-550b-a55b do?

Accepted Answer

nemotron-3-ultra-550b-a55b supports tool calling, extended reasoning. You can call it through any OpenAI-compatible client by pointing base_url to Requesty.

Question 5

How do I use nemotron-3-ultra-550b-a55b with the OpenAI SDK?

Accepted Answer

Install the OpenAI SDK, set base_url to "https://router.requesty.ai/v1", set your API key to your Requesty key, and set the model to "nvidia/nemotron-3-ultra-550b-a55b". The Quickstart above shows Python, JavaScript and cURL snippets.

Question 6

Can I run nemotron-3-ultra-550b-a55b through Requesty?

Accepted Answer

Yes. nemotron-3-ultra-550b-a55b runs through Requesty's OpenAI-compatible API, served from NVIDIA. You do not host the model yourself: point base_url at Requesty, set the model to "nvidia/nemotron-3-ultra-550b-a55b", and requests are routed to the upstream provider with automatic failover. The same key gives you 600+ other models too.

nemotron-3-ultra-550b-a55b

Specifications

Benchmarks

Pricing

Quickstart

Other NVIDIA models

Frequently asked questions

Access nemotron-3-ultra-550b-a55b through Requesty