Question 1

How much video can Nova Lite handle in a single request?

Accepted Answer

Up to 30 minutes of video content per request, processed within the context window of 300K tokens. This covers most meeting recordings, lecture segments, and training videos without splitting.

Question 2

What visual tasks is Nova Lite well suited for?

Accepted Answer

Extraction and classification tasks. Examples include pulling structured data from screenshots, categorizing product photos, reading forms and receipts, and flagging content in moderation pipelines. It's optimized for speed and volume rather than complex visual reasoning.

Question 3

Can I mix images and text in the same prompt?

Accepted Answer

Yes. Nova Lite accepts multiple images alongside text within a single request, which is useful for comparing two documents side by side, processing multi-page forms, or enriching product listings with photo analysis.

Question 4

Does Nova Lite produce any visual output?

Accepted Answer

No. Nova Lite generates text output only. It analyzes visual inputs but does not create them.

Question 5

How does the pricing compare to other multimodal models in the Nova family?

Accepted Answer

Current pricing is shown on this page. AI Gateway routes across providers, and rates may vary by provider.

Question 6

What is the tradeoff compared to Nova Pro for image analysis?

Accepted Answer

Nova Lite prioritizes throughput and cost; Nova Pro prioritizes accuracy. If your images contain dense tables, fine-print legal text, or complex diagrams requiring precise interpretation, Pro produces more reliable results. For routine classification and extraction, Lite is the more efficient choice.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Nova Lite

Frequently Asked Questions