Llama 4 Maverick 17B 128E Instruct

meta/llama-4-maverick

Llama 4 Maverick 17B-128E is Llama 4's largest and most capable model. It uses the Mixture-of-Experts (MoE) architecture and early fusion to provide coding, reasoning, and image capabilities.

Tool UseVision (Image)

import { streamText } from 'ai'

const result = streamText({
  model: 'meta/llama-4-maverick',
  prompt: 'Why is the sky blue?'
})

Playground

Try out Llama 4 Maverick 17B 128E Instruct by Meta. Usage is billed to your team at API rates. Free users get $5 of credits every 30 days, and you are considered a free user if you haven't made a payment.

Chat with

Providers

The AI Gateway supports routing requests across multiple AI providers. You can control provider preferences using the provider slugs available for copying with the buttons below. For more see the AI Gateway provider options documentation. By using the AI provider you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.

Provider

Context	Max Output	Latency	Throughput	Input	Output	Cache	Image Gen	Video Gen	Web Search	Capabilities	ZDR	Release Date

Legal:Terms

•

Privacy

524K

$0.35/M

$1.15/M

—

04/05/2025

Legal:Terms

•

Privacy

131K

0.5s

46tps

$0.15/M

$0.60/M

—

04/05/2025

Legal:Terms

•

Privacy

128K

0.2s

$0.24/M

$0.97/M

—

04/05/2025

Throughput

More models by Meta

All

Text

Code

Model

Context	Latency	Throughput	Input	Output	Capabilities	Providers	ZDR	Release Date

meta/llama-3.1-70b	131K	0.4s	32tps	$0.72/M	$0.72/M		07/23/2024
meta/llama-3.1-8b	131K	0.1s	71tps	$0.10/M	$0.10/M	+2	07/23/2024
meta/llama-3.2-11b	128K	82.0s	32tps	$0.16/M	$0.16/M		09/25/2024
meta/llama-3.2-1b	128K	0.2s	87tps	$0.10/M	$0.10/M		09/18/2024
meta/llama-3.2-3b	128K	0.2s	53tps	$0.15/M	$0.15/M		09/18/2024
meta/llama-3.2-90b	128K	0.3s	61tps	$0.72/M	$0.72/M		09/25/2024
meta/llama-3.3-70b	128K	0.2s	155tps	$0.59/M	$0.72/M		12/06/2024
meta/llama-4-scout	131K	0.1s	191tps	$0.17/M	$0.66/M		04/05/2025

Playground

Chat with

Providers

Provider

Context	Max Output	Latency	Throughput	Input	Output	Cache	Image Gen	Video Gen	Web Search	Capabilities	ZDR	Release Date

Legal:Terms

•

Privacy

524K

$0.35/M

$1.15/M

—

04/05/2025

Legal:Terms

•

Privacy

131K

0.5s

46tps

$0.15/M

$0.60/M

—

04/05/2025

Legal:Terms

•

Privacy

128K

0.2s

$0.24/M

$0.97/M

—

04/05/2025

Throughput

More models by Meta

All

Text

Code

Model

Context	Latency	Throughput	Input	Output	Capabilities	Providers	ZDR	Release Date

meta/llama-3.1-70b	131K	0.4s	32tps	$0.72/M	$0.72/M		07/23/2024
meta/llama-3.1-8b	131K	0.1s	71tps	$0.10/M	$0.10/M	+2	07/23/2024
meta/llama-3.2-11b	128K	82.0s	32tps	$0.16/M	$0.16/M		09/25/2024
meta/llama-3.2-1b	128K	0.2s	87tps	$0.10/M	$0.10/M		09/18/2024
meta/llama-3.2-3b	128K	0.2s	53tps	$0.15/M	$0.15/M		09/18/2024
meta/llama-3.2-90b	128K	0.3s	61tps	$0.72/M	$0.72/M		09/25/2024
meta/llama-3.3-70b	128K	0.2s	155tps	$0.59/M	$0.72/M		12/06/2024
meta/llama-4-scout	131K	0.1s	191tps	$0.17/M	$0.66/M		04/05/2025

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Llama 4 Maverick 17B 128E Instruct

Playground

Providers

More models by Meta

Playground

Providers

More models by Meta