Skip to content
Vercel April 2026 security incident

Nova 2 Lite

amazon/nova-2-lite

Nova 2 Lite is a second-generation multimodal reasoning model with a context window of 1M tokens. It supports configurable extended thinking, web grounding, and code execution at a cost tier built for everyday production use.

ReasoningVision (Image)
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'amazon/nova-2-lite',
prompt: 'Why is the sky blue?'
})

What To Consider When Choosing a Provider

  • Zero Data Retention

    AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.

    Authentication

    AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When extended thinking is on, reasoning tokens are billed at the output token rate. If you use medium or high thinking budgets often, monitor output token spend in the AI Gateway cost dashboard.

When to Use Nova 2 Lite

Best For

  • Mixed-format document processing:

    Combine PDFs, images, and text in a single long-context request

  • Current information workflows:

    Web grounding provides up-to-date information with citations when the provider returns them

  • Agentic task orchestration:

    The model calls external tools across multiple steps

  • Selective extended thinking:

    Toggle deeper reasoning on analytical tasks and use fast extraction by default

  • Unified video and document analysis:

    Video and image analysis combined with document comprehension in a single context of 1M tokens

Consider Alternatives When

  • Text-only cost-focused workloads:

    Nova Micro remains cheaper and faster for pure text processing

  • Simple multimodal tasks:

    First-generation Nova Lite handles simpler vision tasks at lower cost without reasoning overhead

  • Highest structured-document accuracy:

    Evaluate Nova Pro when second-gen reasoning features aren't required

Conclusion

Nova 2 Lite pairs a large multimodal context with optional extended thinking, web grounding, and code execution. You pay extra reasoning tokens only when you turn thinking on. Teams that hit limits on first-generation Nova Lite often move here before jumping to the highest-cost tiers.

FAQ

Under providerOptions.bedrock.reasoningConfig, set maxReasoningEffort to low, medium, or high. Extended thinking is off by default, so you avoid reasoning-token charges until you enable it.

The model can search the public web during a request and ground answers in what it finds, with citations when the provider returns them. Use it when answers need current facts, not only the training snapshot.

1M tokens. That's more than three times the 300K-token context in Nova Lite and Nova Pro, and nearly eight times Nova Micro's 128K.

Yes. Amazon documents code execution for Nova 2 models so the model can run code as part of the response flow. Use it for calculations, data transforms, and similar tasks the provider supports.

Reasoning tokens are billed at the output token rate. With thinking disabled (the default), you pay only standard input and output costs.

No, it's a successor but not a drop-in replacement. Nova 2 Lite has different pricing and a different capability profile. For simple multimodal tasks where first-gen Nova Lite performs well, upgrading may not cut costs. The value lies in the reasoning, grounding, and code execution features the original lacked.