Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Models · Anthropic

Claude Haiku 4.5

Model family: claude-haiku-4

Context

200,000 tokens

Released

2025-09-30

Openness

closed-api

License

Anthropic Commercial Terms of Service · commercial: yes

Cost tier

paid-api

Rating

4.0 ★ — Strong near-frontier capability at the lowest current-Claude price, ideal for high-volume routing, extraction, and moderation. Rated 4.0 because it isn't the cheapest option on the market and, like all Claude, is closed.

Modalities

image-input, text

Capabilities

chat, classification, coding, instruction-following, summarization, tool-use, vision

Access

api-first-party, api-third-party, hosted-chat-ui

llm
closed-api
multimodal
low-cost
high-volume
us-based
proprietary

Quick Take

The fast, cheap Claude: near-frontier quality at the lowest current-generation price, built for high-volume routing, extraction, classification, and moderation.

Plain-English Description

Claude Haiku 4.5 is Anthropic's budget tier — the model you reach for when you need Claude-quality results at high volume and low latency, and don't need the depth of Sonnet or Opus. At $1 in / $5 out per million tokens it's five times cheaper than Sonnet and twenty-five times cheaper than Opus, while still delivering capable, near-frontier intelligence for the kinds of tasks that make up a surprising share of real workloads.

Those tasks are the sweet spot: classification, routing (deciding which bigger model or path a request should take), data extraction, summarization, and content moderation — high-throughput jobs where speed and cost-per-call matter more than maximum reasoning depth. It carries a 200K context window (smaller than Opus/Sonnet's 1M) and the same image-input capability.

A fair caveat: while Haiku is the cheapest current-generation Claude, it isn't the cheapest model on the market — budget tiers from OpenAI, Google, and especially open or DeepSeek models can undercut it. You're paying a modest premium for Claude's quality and ecosystem.

Best For

High-volume, latency-sensitive work: classification, routing, extraction, summarization, moderation.
A cheap first-pass model in a multi-model pipeline (Haiku triages, bigger models handle the hard cases).
Cost-sensitive production tasks that still want Claude-quality output.
Simple coding and tool-use at scale.

Not For

Complex reasoning or hard coding — use Claude Sonnet 4.6 or Claude Opus 4.8.
Very long documents needing a 1M-token window — Haiku is 200K.
The absolute lowest price — open models and DeepSeek undercut it.
Self-hosting — Claude is closed.

License — Plain-English Summary

Proprietary and API-only, on Anthropic's standard commercial terms: commercial rights to your outputs, none to the model, accessed via Anthropic's API or Bedrock/Vertex. Enterprise and zero-retention options exist. No open weights or self-hosting, as with all Claude models.

How It Compares

Against Claude Sonnet 4.6, Haiku is far cheaper and faster but less capable — the right tool for volume, not depth. Against OpenAI's GPT-5.4 Mini and Google's Gemini 3.5 Flash, Haiku competes on quality-per-dollar in the budget tier, though those rivals (and cheaper open models) can win on raw price. The usual pattern is to use Haiku as the cheap workhorse and escalate only the requests that need more.

Cost

API input (per 1M tokens): $1.00
API output (per 1M tokens): $5.00
API providers: anthropic-api, amazon-bedrock, google-vertex-ai
Notes: $1.00 input / $5.00 output per million tokens — the cheapest current-generation Claude (5x cheaper than Sonnet, 25x cheaper than Opus). 200K context. Prompt caching and batch discounts apply.

Comparable models

Commercial-use conditions

Commercial use permitted through the Anthropic API / Bedrock / Vertex under Anthropic's terms; no weights, no modification, no redistribution.