Skip to content

Imagen 4

Imagen 4 is Google's standard-tier image generation model, the default production choice in the Imagen 4 family, balancing photorealistic quality with practical generation speed for commercial photography, scene generation, and everyday visual asset pipelines.

Image Gen
index.ts
import { experimental_generateImage as generateImage } from 'ai';
const result = await generateImage({
model: 'google/imagen-4.0-generate-001',
prompt: 'A red balloon on a wooden table.'
});

Frequently Asked Questions

  • How does the standard tier compare to the ultra tier for realistic photography?

    The standard tier delivers strong photorealistic quality suitable for most commercial photography use cases, including product shots, scenes, and environmental images. The ultra tier (google/imagen-4.0-ultra-generate-001) targets the absolute fidelity ceiling: fine texture detail, color depth, and rendering precision that matters in print, large-format display, or publication contexts. For web and digital applications, the standard tier is often sufficient.

  • How accurately does this model follow complex prompts?

    Imagen 4 follows complex prompts closely. The model renders specific lighting descriptions, material properties, compositional directions, and subject characteristics faithfully. For production workflows where prompt investment is significant, this reliability matters.

  • Is this model suitable for e-commerce product imagery?

    Yes. Commercial product photography is a well-suited use case, with accurate material rendering, controlled backgrounds, and consistent quality across SKUs. For catalog-scale generation with high quality, this is the standard tier to evaluate first.

  • How do I call this model from the AI SDK?

    Use experimental_generateImage (aliased as generateImage) with model: 'google/imagen-4.0-generate-001'.

  • Does this model support image-to-image editing?

    This is a text-to-image generation model. For editing and inpainting, check the AI Gateway model catalog for models that explicitly support those workflows.

  • Do I need to manage Google API credentials separately?

    No. AI Gateway handles all provider authentication. Connect using your Vercel API key or OIDC token.

  • What is the difference between this model and a multimodal model that can generate images?

    Imagen 4.0 Generate is an image-only model: the only output is image data. Multimodal models like google/gemini-3-pro-image can generate images alongside text explanations, analysis, or instructions. Use the image-only model when you want clean, structured image output without a text layer.

  • Can I test this model in the playground?

    Yes. Visit https://ai-sdk.dev/playground/vertex:imagen-4.0-generate-001 to generate images directly from the AI Gateway model playground without writing code.