Voyage Rerank 2.5 Lite
Voyage Rerank 2.5 Lite is Voyage AI's generalist reranker balanced for latency and quality. It supports a context window of 32K tokens, instruction-following, and multilingual reranking. It improves accuracy by 7.16% over Cohere Rerank v3.5 across 93 retrieval datasets.
import { rerank } from 'ai';
const result = await rerank({ model: 'voyage/rerank-2.5-lite', query: 'What is the capital of France?', documents: [ 'Paris is the capital of France.', 'Berlin is the capital of Germany.', 'Madrid is the capital of Spain.', ],})Frequently Asked Questions
What is the difference between Voyage Rerank 2.5 Lite and rerank-2.5?
rerank-2.5is the quality-optimized tier of the series and improves on Cohere Rerank v3.5 by 7.94%. Voyage Rerank 2.5 Lite is the latency-and-quality balanced tier and improves on Cohere Rerank v3.5 by 7.16%. Both support instruction-following, the same context window of 32K tokens, and multilingual retrieval.How does Voyage Rerank 2.5 Lite compare to Qwen3-Reranker-8B?
Voyage AI reports that Voyage Rerank 2.5 Lite performs better than Qwen3-Reranker-8B on the published benchmark suite despite being over an order of magnitude smaller. Averaged across four first-stage retrieval methods, Voyage Rerank 2.5 Lite outperforms Qwen3-Reranker-8B by 1.01% on NDCG@10.
What is instruction-following in Voyage Rerank 2.5 Lite?
Instruction-following lets you steer relevance scores using natural language. You pass an instruction with the query and candidates, and Voyage Rerank 2.5 Lite adjusts scoring to reflect it. On the MAIR benchmark, this lifts Voyage Rerank 2.5 Lite 10.36% above Cohere Rerank v3.5.
What is the context window for Voyage Rerank 2.5 Lite?
32K tokens. That is double the window of
rerank-2-liteand eight times that of Cohere Rerank v3.5, so longer query-document pairs fit a single rerank call without truncation.Which first-stage retrievers does Voyage Rerank 2.5 Lite work with?
Voyage Rerank 2.5 Lite reorders candidates from any first-stage method. Voyage AI benchmarks it on BM25 lexical search, OpenAI
text-embedding-3-large,voyage-3-large, andvoyage-3.5. You can keep your existing embedding model and add Voyage Rerank 2.5 Lite as a second-stage reranker.Does Voyage Rerank 2.5 Lite support multilingual retrieval?
Yes. Voyage Rerank 2.5 Lite reranks across many languages without separate per-language models. Voyage AI reports consistent improvement across all evaluated languages and first-stage retrieval methods.
How do I access Voyage Rerank 2.5 Lite through Vercel AI Gateway?
Add your Voyage AI API key in AI Gateway settings, then send rerank requests through AI Gateway. AI Gateway authenticates requests and records usage. You can call Voyage Rerank 2.5 Lite through the AI SDK alongside Chat Completions, Responses, and Messages API formats.
Is Zero Data Retention available for Voyage Rerank 2.5 Lite?
Zero Data Retention is not currently available for this model. Zero Data Retention is offered on a per-provider basis. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.