Voyage 4 Lite
Voyage 4 Lite is the lightweight Voyage 4 model. Voyage AI reports it approaches voyage-3.5 retrieval accuracy with fewer parameters, shares one embedding space with voyage-4-large and voyage-4, and supports a context window of 32K tokens with Matryoshka dimensions and quantization like the rest of the family.
import { embed } from 'ai';
const result = await embed({ model: 'voyage/voyage-4-lite', value: 'Sunny day at the beach',})About Voyage 4 Lite
Voyage 4 Lite strips down the Voyage 4 architecture to fewer parameters, released N/A. The result is a model that processes tokens faster and cheaper than its siblings while retaining enough retrieval quality for most production use cases.
Voyage AI benchmarks Voyage 4 Lite near voyage-3.5 retrieval accuracy. For teams running millions of daily requests or indexing large corpora on a budget, the per-token savings add up fast. Development and staging environments also benefit: cheaper iteration cycles let you experiment with chunking strategies and retrieval pipelines without burning through credits.
Because all Voyage 4 models produce compatible vectors, you aren't locked into Voyage 4 Lite for every step of your pipeline. Index your corpus with a stronger variant, then point live traffic at Voyage 4 Lite for lower query costs. No re-indexing required.