Nova Micro

Nova Micro delivers text-only inference at high throughput with per-token pricing below multimodal Nova models in the same generation, purpose-built for latency-sensitive applications at scale.

Tool Use

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'amazon/nova-micro',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Latency Uptime Status Similar FAQ

More models by Amazon

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

amazon/nova-2-lite

0.4s

224tps

$0.30/M

$2.50/M

Read:$0.07/M

Write:—

—

12/02/2025

amazon/nova-lite

300K

0.5s

138tps

$0.06/M

$0.24/M

—

12/03/2024

amazon/nova-pro

300K

0.3s

133tps

$0.80/M

$3.20/M

—

12/03/2024

amazon/titan-embed-text-v2

$0.02/M

—

04/30/2024

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Nova Micro

More models by Amazon