Skip to content

Nova Micro

Nova Micro delivers text-only inference at high throughput with per-token pricing below multimodal Nova models in the same generation, purpose-built for latency-sensitive applications at scale.

index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'amazon/nova-micro',
prompt: 'Why is the sky blue?'
})
Throughput

P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.