Skip to content

Gemini Embedding 001

Gemini Embedding 001 is Google's generally available text embedding model, with a strong position on the Massive Text Embedding Benchmark (MTEB) Multilingual leaderboard across retrieval, classification, and other tasks, with Matryoshka-based dimension flexibility and support for over 100 languages.

index.ts
import { embed } from 'ai';
const result = await embed({
model: 'google/gemini-embedding-001',
value: 'Sunny day at the beach',
})

Frequently Asked Questions

  • How does gemini-embedding-001 rank on MTEB Multilingual?

    It ranks highly on the MTEB Multilingual leaderboard, a position it has maintained since its experimental launch.

  • What is Matryoshka Representation Learning and how does it affect output dimensions?

    MRL is a training technique that nests information across dimension scales, allowing the model to produce embeddings that remain meaningful when truncated to smaller sizes. Google recommends 3,072, 1,536, or 768 dimensions; the default is 3,072 for highest quality.

  • How many languages does gemini-embedding-001 support?

    The model supports over 100 languages, consistent with its strong multilingual benchmark results.

  • What is the maximum input token length per request?

    A maximum of 2,048 input tokens per embedding request.

  • What is the pricing?

    This page lists the current rates. Multiple providers can serve Gemini Embedding 001, so AI Gateway surfaces live pricing rather than a single fixed figure.

  • How is gemini-embedding-001 priced on AI Gateway?

    This page lists the current rates. Multiple providers can serve Gemini Embedding 001, so AI Gateway surfaces live pricing rather than a single fixed figure.