Claude 3.5 Haiku

Claude 3.5 Haiku was the model that first proved a Haiku-tier release could match Opus-class performance, scoring strong results on SWE-bench Verified and redefining expectations for what fast, affordable models could accomplish on real engineering tasks.

File InputTool UseVision (Image)Explicit Caching

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'anthropic/claude-3.5-haiku',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

About Claude 3.5 Haiku

Before Claude 3.5 Haiku arrived on November 4, 2024, the Haiku tier was the budget option: fast and cheap, but a step down in capability. Claude 3.5 Haiku broke that assumption. Anthropic announced it matched Claude 3 Opus on many intelligence evaluations while running at speeds comparable to the original Claude 3 Haiku. The result: Opus-class reasoning at Haiku-class latency.

The SWE-bench Verified result crystallized this. At 40.6%, Claude 3.5 Haiku outperformed agents built on the original Claude 3.5 Sonnet and other comparable models at the time on real-world software engineering tasks: fixing bugs, implementing features, and navigating codebases. For a model designed for speed and low cost, posting that score on a benchmark requiring sustained multi-file reasoning was a landmark.

Anthropic designed Claude 3.5 Haiku for three deployment patterns: latency-sensitive user-facing products, specialized sub-agent roles inside agentic pipelines, and personalization engines processing large volumes of per-user data such as purchase histories, pricing records, and inventory feeds. Improved instruction following and more accurate tool use make it practical for the structured, schema-bound work that sub-agents handle in production systems.

Live rates are shown in the pricing panel on this page.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Claude 3.5 Haiku

About Claude 3.5 Haiku