Gemini 3 Pro Preview
Gemini 3 Pro Preview is the flagship reasoning model in the Gemini 3 generation for demanding agentic and analytical tasks, with improvements in multi-step function calling, complex image reasoning, long-document analysis, and instruction following over Gemini 2.5 Pro.
import { streamText } from 'ai'
const result = streamText({ model: 'google/gemini-3-pro-preview', prompt: 'Why is the sky blue?'})Frequently Asked Questions
What reasoning capabilities does Gemini 3 Pro Preview have over Gemini 2.5 Pro?
Four specific improvements: multi-step function calling, planning, reasoning over complex images and long documents, and instruction following. These directly address the reliability gaps that affect agentic workflows at scale.
How do I enable reasoning traces in my application?
Set
includeThoughtstotrueunderproviderOptions.google.thinkingConfigin the AI SDK. UsestreamTextfor streaming, and the model emits reasoning tokens alongside the generated response.Is Gemini 3 Pro Preview suitable for real-time user-facing applications?
It can be, but it is a reasoning model with higher latency than the Flash tier. For interactive applications where sub-second responses are required,
google/gemini-3-flashprovides pro-grade reasoning at significantly lower latency.Does the model support analyzing PDFs and complex documents?
Yes. The model handles long documents with embedded charts, diagrams, and images. Improved reasoning over complex images and long documents is one of its headline capabilities over Gemini 2.5 Pro.
What is the difference between Gemini 3 Pro Preview and Gemini 3.1 Pro?
Gemini 3.1 Pro introduces additional quality improvements for software engineering and agentic tasks, enhanced usability for finance and spreadsheet applications, and more efficient thinking that reduces token consumption. Gemini 3 Pro Preview was the initial release; 3.1 Pro builds on that foundation.
Does Gemini 3 Pro Preview require a Google Cloud account when accessed via AI Gateway?
No. AI Gateway manages all underlying provider credentials. You authenticate once using a Vercel API key or OIDC token.
What does multi-step function calling improvement mean in practice?
The model more reliably executes sequences of tool calls: choosing the right tool, interpreting its output, deciding whether to call another tool, and knowing when the task is complete. This reduces the need for human intervention to correct routing errors mid-workflow.
Can I use Gemini 3 Pro Preview for analyzing images alongside text?
Yes. You can pass image inputs alongside text prompts to enable cross-modal analysis within a single request.