Model overview
- Model ID: nvidia/nemotron-3-ultra-550b-a55b
- Provider: nvidia
- Type: language
- Context window: 1,000,000 tokens
- Max output tokens: 65,000
Tags: reasoning, tool-use, implicit-caching
Model pricing
| Metric | Value |
|---|---|
Input tokens (/1M) | $0.60 |
Output tokens (/1M) | $2.40 |
Image generation | n/a |
Cached input read (/1M) | $0.12 |
Cached input write (/1M) | n/a |
Pricing source | gateway |