About Nova 2 Lite

Nova 2 Lite is the cost-focused model in Amazon's second-generation Nova family. The context window moves from 300K tokens on first-gen Nova Lite and Nova Pro up to 1M tokens. Extended thinking lets the model run deliberate multi-step reasoning before it answers. You set it to low, medium, or high. It stays off by default when you don't need it.

Amazon pairs Nova 2 Lite and Nova 2 Pro with built-in web grounding and code execution: the model can pull current public information and run code so answers stay tied to fresh data, not only to training cutoffs.

Inputs include text, images, video, and documents (including PDFs) within the window of 1M tokens. That covers long documents with images, video with transcripts, and similar mixed workloads that first-generation Nova models handle with tighter limits.

What To Consider When Choosing a Provider

Configuration: When extended thinking is on, reasoning tokens are billed at the output token rate. If you use medium or high thinking budgets often, monitor output token spend in the AI Gateway cost dashboard.
Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Nova 2 Lite

Best for

Mixed-format document processing: Combine PDFs, images, and text in a single long-context request
Current information workflows: Web grounding provides up-to-date information with citations when the provider returns them
Agentic task orchestration: The model calls external tools across multiple steps
Selective extended thinking: Toggle deeper reasoning on analytical tasks and use fast extraction by default
Unified video and document analysis: Video and image analysis combined with document comprehension in a single context of 1M tokens

Consider alternatives when

Text-only cost-focused workloads: Nova Micro remains cheaper and faster for pure text processing
Simple multimodal tasks: First-generation Nova Lite handles simpler vision tasks at lower cost without reasoning overhead
Highest structured-document accuracy: Evaluate Nova Pro when second-gen reasoning features aren't required

Conclusion

Nova 2 Lite pairs a large multimodal context with optional extended thinking, web grounding, and code execution. You pay extra reasoning tokens only when you turn thinking on. Teams that hit limits on first-generation Nova Lite often move here before jumping to the highest-cost tiers.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Nova 2 Lite

Playground

Providers

More models by Amazon