Gemini Embedding 001
Gemini Embedding 001 is Google's generally available text embedding model, with a strong position on the Massive Text Embedding Benchmark (MTEB) Multilingual leaderboard across retrieval, classification, and other tasks, with Matryoshka-based dimension flexibility and support for over 100 languages.
import { embed } from 'ai';
const result = await embed({ model: 'google/gemini-embedding-001', value: 'Sunny day at the beach',})Frequently Asked Questions
How does gemini-embedding-001 rank on MTEB Multilingual?
It ranks highly on the MTEB Multilingual leaderboard, a position it has maintained since its experimental launch.
What is Matryoshka Representation Learning and how does it affect output dimensions?
MRL is a training technique that nests information across dimension scales, allowing the model to produce embeddings that remain meaningful when truncated to smaller sizes. Google recommends 3,072, 1,536, or 768 dimensions; the default is 3,072 for highest quality.
How many languages does gemini-embedding-001 support?
The model supports over 100 languages, consistent with its strong multilingual benchmark results.
What is the maximum input token length per request?
A maximum of 2,048 input tokens per embedding request.
What is the pricing?
This page lists the current rates. Multiple providers can serve Gemini Embedding 001, so AI Gateway surfaces live pricing rather than a single fixed figure.
How is gemini-embedding-001 priced on AI Gateway?
This page lists the current rates. Multiple providers can serve Gemini Embedding 001, so AI Gateway surfaces live pricing rather than a single fixed figure.