Nova Lite
Nova Lite brings multimodal understanding of text, images, and up to 30 minutes of video to a context of 300K tokens at low per-token rates, making vision-enabled production pipelines practical at scale.
import { streamText } from 'ai'
const result = streamText({ model: 'amazon/nova-lite', prompt: 'Why is the sky blue?'})Frequently Asked Questions
How much video can Nova Lite handle in a single request?
Up to 30 minutes of video content per request, processed within the context window of 300K tokens. This covers most meeting recordings, lecture segments, and training videos without splitting.
What visual tasks is Nova Lite well suited for?
Extraction and classification tasks. Examples include pulling structured data from screenshots, categorizing product photos, reading forms and receipts, and flagging content in moderation pipelines. It's optimized for speed and volume rather than complex visual reasoning.
Can I mix images and text in the same prompt?
Yes. Nova Lite accepts multiple images alongside text within a single request, which is useful for comparing two documents side by side, processing multi-page forms, or enriching product listings with photo analysis.
Does Nova Lite produce any visual output?
No. Nova Lite generates text output only. It analyzes visual inputs but does not create them.
How does the pricing compare to other multimodal models in the Nova family?
Current pricing is shown on this page. AI Gateway routes across providers, and rates may vary by provider.
What is the tradeoff compared to Nova Pro for image analysis?
Nova Lite prioritizes throughput and cost; Nova Pro prioritizes accuracy. If your images contain dense tables, fine-print legal text, or complex diagrams requiring precise interpretation, Pro produces more reliable results. For routine classification and extraction, Lite is the more efficient choice.