Skip to content

Imagen 4

Imagen 4 is Google's standard-tier image generation model, the default production choice in the Imagen 4 family, balancing photorealistic quality with practical generation speed for commercial photography, scene generation, and everyday visual asset pipelines.

Image Gen
index.ts
import { experimental_generateImage as generateImage } from 'ai';
const result = await generateImage({
model: 'google/imagen-4.0-generate-001',
prompt: 'A red balloon on a wooden table.'
});

Playground

Try out Imagen 4 by Google. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Google Vertex AI
Legal:Terms
Privacy
480
05/22/2025

More models by Google

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
1M
2.2s
306tps
$1.50/M$9.00/M
Read:$0.15/M
Write:
$14.00/K
+ input costs
google logo
vertex logo
05/19/2026
1M
0.8s
241tps
$0.25/M$1.50/M
Read:$0.03/M
Write:
$14.00/K
+ input costs
google logo
vertex logo
03/03/2026
1M
3.8s
108tps
$2.00/M
$12.00/M
Read:
$0.2/M
Write:
$14.00/K
+ input costs
google logo
vertex logo
02/19/2026
1M
0.9s
183tps
$0.50/M
$3.00/M
Read:
$0.05/M
Write:
$14.00/K
+ input costs
google logo
vertex logo
12/17/2025
1M
0.4s
180tps
$0.10/M$0.40/M
Read:$0.01/M
Write:
$35.00/K
+ input costs
google logo
vertex logo
06/17/2025
1M
0.4s
173tps
$0.30/M$2.50/M
Read:$0.03/M
Write:
$35.00/K
+ input costs
google logo
vertex logo
03/20/2025

About Imagen 4

Imagen 4 is the standard tier in Google's Imagen 4 image generation family. It represents the typical production choice when neither speed nor quality is at an extreme. It launched alongside the fast and ultra variants as part of a new category of image-only models on AI Gateway, distinct from multimodal models that generate images as one output type among several.

The standard tier covers a wide range of commercial use cases: product photography with accurate color reproduction and material rendering, scene generation for e-commerce and marketing, editorial illustrations, and visual asset creation for content platforms. It delivers photorealistic quality across nature, wildlife, and cinematic scenes without requiring the ultra tier's additional cost.

Prompt adherence is a core strength of Imagen 4. The model renders complex descriptions (specific lighting conditions, material properties, background environment, and subject pose) with fidelity to the written description. For production pipelines where prompt engineering is a significant investment, this reliability means a refined prompt produces consistent, repeatable output.

As an image-only model, integration is straightforward. Pass prompts via experimental_generateImage (aliased as generateImage) from the AI SDK. There's no text response to parse, no modality configuration to establish, and no reasoning overhead.

What To Consider When Choosing a Provider

  • Configuration: Imagen 4.0 Generate is a pure image generation model: pass a text prompt to generateImage and receive image data. Unlike multimodal models that can generate both text and images, there is no mixed-modality response to manage.
  • Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Imagen 4

Best For

  • Commercial product photography: Photorealistic quality is required at a standard generation cost
  • Marketing and editorial images: Generation across a range of subjects, scenes, people, objects, and environments
  • Content platform asset pipelines: Consistent, reliable quality across large batches
  • General-purpose image generation: Neither the fast tier's trade-offs nor the ultra tier's premium is justified
  • Quality-first default migration: Applications moving from other image generation APIs that want reliable defaults

Consider Alternatives When

  • Throughput or speed primary: Generation speed or volume is the main constraint (consider google/imagen-4.0-fast-generate-001)
  • Publication-grade detail required: Print quality or absolute maximum fidelity is necessary (consider google/imagen-4.0-ultra-generate-001)
  • Text output alongside images: Mixed-modality output is needed (consider google/gemini-3-pro-image)
  • Video generation required: Still images are not sufficient (consider the Veo model family)

Conclusion

Imagen 4 is the sensible default for production image generation on AI Gateway. It delivers Imagen 4's photorealistic quality at a cost and speed profile that makes it broadly applicable. It's the right choice until a specific workload gives you a clear reason to move to the fast or ultra tier.

Frequently Asked Questions

  • How does the standard tier compare to the ultra tier for realistic photography?

    The standard tier delivers strong photorealistic quality suitable for most commercial photography use cases, including product shots, scenes, and environmental images. The ultra tier (google/imagen-4.0-ultra-generate-001) targets the absolute fidelity ceiling: fine texture detail, color depth, and rendering precision that matters in print, large-format display, or publication contexts. For web and digital applications, the standard tier is often sufficient.

  • How accurately does this model follow complex prompts?

    Imagen 4 follows complex prompts closely. The model renders specific lighting descriptions, material properties, compositional directions, and subject characteristics faithfully. For production workflows where prompt investment is significant, this reliability matters.

  • Is this model suitable for e-commerce product imagery?

    Yes. Commercial product photography is a well-suited use case, with accurate material rendering, controlled backgrounds, and consistent quality across SKUs. For catalog-scale generation with high quality, this is the standard tier to evaluate first.

  • How do I call this model from the AI SDK?

    Use experimental_generateImage (aliased as generateImage) with model: 'google/imagen-4.0-generate-001'.

  • Does this model support image-to-image editing?

    This is a text-to-image generation model. For editing and inpainting, check the AI Gateway model catalog for models that explicitly support those workflows.

  • Do I need to manage Google API credentials separately?

    No. AI Gateway handles all provider authentication. Connect using your Vercel API key or OIDC token.

  • What is the difference between this model and a multimodal model that can generate images?

    Imagen 4.0 Generate is an image-only model: the only output is image data. Multimodal models like google/gemini-3-pro-image can generate images alongside text explanations, analysis, or instructions. Use the image-only model when you want clean, structured image output without a text layer.

  • Can I test this model in the playground?

    Yes. Visit https://ai-sdk.dev/playground/vertex:imagen-4.0-generate-001 to generate images directly from the AI Gateway model playground without writing code.