Claude 3 Haiku launched on March 1, 2023 as the fastest, lowest-cost model in the Claude 3 family. Anthropic positioned it alongside Sonnet (mid-tier) and Opus (highest capability). Haiku targets workloads where throughput and cost per token matter more than peak reasoning depth.
Speed defined the release. Anthropic described Claude 3 Haiku as three times faster than peer models in its performance tier for the majority of workloads, with throughput dropping on prompts that exceed 32K tokens.
Anthropic noted a 1:5 input-to-output token ratio and reported the model ran at half the cost of competing models in its tier at launch.
Haiku ships with the same vision architecture as Sonnet and Opus. It accepts photos, charts, graphs, and technical diagrams as input. Anthropic highlighted document analysis (quarterly filings, contracts, legal cases), responsive customer support chat, and large-dataset annotation as primary use cases. The model launched on the Claude API and Amazon Bedrock, with Google Cloud Vertex AI following shortly after.