Nemotron 3 Ultra
A 550B parameter (55B active) open reasoning model from NVIDIA, built for long-running agent workflows. It uses a hybrid Mamba-Transformer MoE architecture and supports a 1M token context window.
import { streamText } from 'ai'
const result = streamText({ model: 'nvidia/nemotron-3-ultra-550b-a55b', prompt: 'Why is the sky blue?'})Playground
Try out Nemotron 3 Ultra by NVIDIA. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.