Veo 3.1

Veo 3.1 is Google's flagship video model in the Veo 3.1 generation on AI Gateway, the quality ceiling of that generation, with strong motion fidelity, native audio-visual synchronization, and image-to-video support for professional production workflows.

text-to-videoimage-to-video

index.ts

import { experimental_generateVideo as generateVideo } from 'ai';

const result = await generateVideo({
  model: 'google/veo-3.1-generate-001',
  prompt: 'A serene mountain lake at sunrise.'
});

Overview About Providers Similar FAQ

About Veo 3.1

Veo 3.1 represents the top of the Veo 3.1 generation on AI Gateway. This 3.1 standard-quality configuration applies full generation compute to each request. The Veo 3.1 generation improved motion physics accuracy, object coherence across frames, and prompt adherence relative to 3.0. In standard-quality mode, these improvements are most apparent: frame-to-frame consistency is tighter, object boundaries hold through complex motion, and audio-visual synchronization is more precise.

This is the endpoint for video workflows that have moved through the iteration phase and are generating final deliverables. A typical production workflow uses Veo 3.1 Fast for prompt exploration (generating many variations quickly), then routes the validated direction to Veo 3.1 for the final render. Both configurations share the same generation architecture; the distinction is how much compute each generation receives.

Image-to-video generation is fully supported for workflows that animate reference assets: product photography in motion, character illustrations brought to life, or architectural visualizations with simulated lighting. Native audio generation covers ambient sound, effects, and synchronized dialogue without post-production audio work.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Veo 3.1

About Veo 3.1