Question 1

What is the difference between Voyage 4 Large and voyage-4?

Accepted Answer

Voyage 4 Large is the MoE flagship with the highest average scores in Voyage AI's published Voyage 4 comparison. `voyage-4` is the mid-sized model. Both share the same embedding space as `voyage-4-lite`.

Question 2

How does Voyage 4 Large compare to voyage-3-large?

Accepted Answer

Voyage AI reports better retrieval accuracy than voyage-3-large at a lower price, using MoE and the Voyage 4 training stack. Moving from Voyage 3 to Voyage 4 requires re-embedding because the embedding space changes.

Question 3

What is the context window for Voyage 4 Large?

Accepted Answer

32K tokens. Size chunks so single requests stay under this limit on long documents.

Question 4

When should I use Voyage 4 Large over voyage-4-lite?

Accepted Answer

Use Voyage 4 Large when you need the strongest published Voyage 4 vectors, especially for one-time or infrequent document embedding. Use `voyage-4-lite` when you want fewer parameters for queries or symmetric indexing at lower compute.

Question 5

How do I access Voyage 4 Large through Vercel AI Gateway?

Accepted Answer

Add your Voyage AI API key in AI Gateway settings, then send embedding requests through AI Gateway. AI Gateway authenticates requests and records usage.

Question 6

Do I need to re-embed my data to switch from voyage-3-large?

Accepted Answer

Yes. Moving from Voyage 3 to Voyage 4 requires re-embedding because the embedding space is new. Within Voyage 4, you can often keep `voyage-4-large` document vectors and change query models if you use asymmetric retrieval.

Question 7

Is Voyage 4 Large suitable for RAG applications?

Accepted Answer

Yes. Voyage AI positions it for retrieval-augmented generation and high-accuracy document indexing, including asymmetric setups where queries use a smaller Voyage 4 model.

Question 8

What is mixture-of-experts in Voyage 4 Large?

Accepted Answer

Voyage 4 Large routes tokens through expert subnetworks so Voyage AI can raise accuracy while reporting serving costs about 40% lower than comparable dense models.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Voyage 4 Large

Frequently Asked Questions