Nova Micro

Nova Micro delivers text-only inference at high throughput with per-token pricing below multimodal Nova models in the same generation, purpose-built for latency-sensitive applications at scale.

import { streamText } from 'ai'

const result = streamText({
  model: 'amazon/nova-micro',
  prompt: 'Why is the sky blue?'
})

Overview Playground About Providers Throughput Latency Uptime Status Similar FAQ

Throughput24 hours

P50 throughput on live AI Gateway traffic, in tokens per second (TPS). See the docs for more information.

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Nova Micro