Llama 4 Maverick 17B 128E Instruct FP8

Llama 4 Maverick 17B 128E Instruct FP8 is Meta's natively multimodal Mixture of Experts (MoE) model with 17B active parameters across 128 experts. Published benchmarks span image and text tasks, and the MoE activates a fraction of the parameters that comparable dense models use.

Tool UseVision (Image)

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'meta/llama-4-maverick',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Max Output	Latency	Throughput	Input	Output	Cache	Web Search	Capabilities	ZDR	No Training	Release Date

DeepInfra

Legal:Terms

•

Privacy

131K

0.3s

45tps

$0.20/M

$0.80/M

—

04/05/2025

Bedrock

Legal:Terms

•

Privacy

128K

0.2s

128tps

$0.24/M

$0.97/M

—

04/05/2025

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Llama 4 Maverick 17B 128E Instruct FP8

Providers