Voyage Law 2

Voyage Law 2 is Voyage AI's legal-specialized embedding model trained on one trillion legal tokens. It outperforms OpenAI text-embedding-3-large by 6% across eight legal datasets and achieves 84.44 NDCG@10 on long-context legal retrieval versus 68.40 for OpenAI.

index.ts

import { embed } from 'ai';

const result = await embed({
  model: 'voyage/voyage-law-2',
  value: 'Sunny day at the beach',
})

Overview About Providers Similar FAQ

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Input	Capabilities	ZDR	No Training	Release Date

Voyage AI

Legal:Terms

•

Privacy

$0.12/M

04/15/2024

More models by Voyage AI

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Capabilities	Providers	ZDR	No Training	Release Date

voyage/voyage-4-large

32K

$0.12/M

—

01/15/2026

voyage/voyage-4-lite

32K

$0.02/M

—

01/15/2026

voyage/voyage-4

32K

$0.06/M

—

01/15/2026

voyage/rerank-2.5

32K

$0.05/M

—

08/11/2025

voyage/voyage-3-large

$0.18/M

—

01/07/2025

voyage/voyage-code-3

$0.18/M

—

12/04/2024

About Voyage Law 2

Voyage Law 2 is Voyage AI's legal-specialized embedding model, released April 15, 2024. Voyage AI trained it on one trillion high-quality legal tokens using specifically designed positive pairs and a contrastive learning algorithm. The model handles diverse legal content including contracts, congressional bills, court cases, and statutes across multiple jurisdictions: U.S., Chinese, German, and Indian.

Across eight legal retrieval datasets, Voyage Law 2 outperforms OpenAI text-embedding-3-large by 6% on average, with improvements exceeding 10% on LeCaRDv2, LegalQuAD, and GerDaLIR. On long-context legal retrieval, Voyage Law 2 achieves 84.44 NDCG@10 compared to 68.40 for OpenAI. That's a 23% relative improvement reflecting the model's strength on lengthy legal documents.

Voyage AI intentionally mixed legal training data with finance, technology, and intellectual property domains. This ensures Voyage Law 2 performs well on non-legal retrieval tasks while maintaining its legal specialization. Teams with mixed legal and business content don't need a separate general-purpose model for non-legal documents in the same index.

What To Consider When Choosing a Provider

Configuration: Voyage Law 2 excels on long-context legal retrieval, achieving 84.44 NDCG@10 versus 68.40 for OpenAI. If your legal corpus contains lengthy contracts, court opinions, or statutory texts, this is where Voyage Law 2 provides the largest accuracy gains.
Configuration: Voyage AI trains and evaluates Voyage Law 2 on U.S., Chinese, German, and Indian legal content. If your practice spans these jurisdictions, Voyage Law 2 handles cross-jurisdictional retrieval within a single index.
Configuration: Voyage AI's voyage-3-large now outperforms domain-specific models on legal benchmarks. For new deployments with mixed legal and non-legal content, evaluate voyage-3-large or voyage-3.5 as alternatives.
Zero Data Retention: AI Gateway does not currently support Zero Data Retention for this model. See the documentation for models that support ZDR.
Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Voyage Law 2

Best for

Legal research and discovery: Contracts, case law, and statutes where domain-specific terminology and document structure matter
Long-document legal retrieval: Contracts and court opinions span many pages and key passages appear deep in the text
Multi-jurisdictional legal search: U.S., Chinese, German, and Indian legal content within a single index
RAG for legal applications: Retrieving precise statutory language, case citations, and contractual clauses improves generated output
Compliance and regulatory retrieval: Searches across regulations, guidance, and enforcement actions from multiple legal systems

Consider alternatives when

Your content spans multiple domains beyond legal: Voyage-3.5 or voyage-3-large provides cross-domain retrieval that includes legal
You want Matryoshka dimensionality and quantization: Voyage-3.5 offers these while covering legal content
Your legal documents are primarily short-form: Clauses and headnotes benefit less from long-context strength
You need code or financial retrieval alongside legal content: A general-purpose model avoids managing multiple specialized indices

Conclusion

Voyage Law 2 provides measurable accuracy gains for legal document retrieval, particularly on long-context tasks where it outperforms OpenAI by 23% relative. Its training on one trillion legal tokens across multiple jurisdictions makes it well suited for legal research, discovery, and compliance workflows. Access it through AI Gateway for unified provider management and the flexibility to combine it with other embedding models as your needs evolve.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Voyage Law 2

Providers

More models by Voyage AI

About Voyage Law 2

What To Consider When Choosing a Provider

When to Use Voyage Law 2

Best for

Consider alternatives when

Conclusion