Claude 4.5 Haiku

Our fastest model, a lightweight version of our most powerful AI, at a more affordable price

Announcements

New
Claude Haiku 4.5
Oct 15, 2025
Claude Haiku 4.5 is our fastest, most cost-efficient model, matching Sonnet 4’s performance on coding, computer use, and agent tasks. Claude Haiku 4.5 scores 73.3% on SWE-bench Verified, making it one of the world's best coding models.
Read more
Claude 3.5 Haiku
Oct 22, 2024
For a similar speed to Haiku 3, Haiku 3.5 improved across every skill set and surpassed Opus 3, the largest model in our previous generation, on many intelligence benchmarks.
Read more

Availability and pricing

Anyone can chat with Claude using Haiku 4.5 on Claude.ai, available on web, iOS, and Android.

For developers, Haiku 4.5 is available on the Claude Platform natively, and in Amazon Bedrock, Google Cloud's Vertex AI, and Microsoft Foundry.

Claude Haiku 4.5 is also available in Claude Code.

Pricing for Haiku 4.5 on the Claude Platform starts at $1 per million input tokens and $5 per million output tokens, with up to 90% cost savings with prompt caching and 50% cost savings with batch processing. To get started, simply use claude-haiku-4-5 via the Claude API.

Use cases

Haiku 4.5 brings advanced capabilities at a price that makes Claude practical for scaled deployments, from powering free tier products to budget-conscious applications that still need strong performance.

Powering free user experiences

Haiku 4.5 is fast and affordable enough to power AI agents for all your users, even in your free tier.

Latency-sensitive experiences

Haiku 4.5 is now ideal for real-time applications like customer service agents and chatbots where response time is critical.

Coding sub-agents

Haiku 4.5 powers sub-agents, enabling multi-agent systems that tackle complex refactors, migrations, and large feature builds with quality and speed.

Financial analysis

Haiku 4.5 can monitor thousands of data streams at once—tracking regulatory changes, market signals, and portfolio risks in real time.

Research sub-agents

Haiku 4.5 can tackle dozens of research sources simultaneously, from literature reviews to data synthesis, in hours instead of weeks.

Benchmarks

Haiku 4.5 delivers strong performance and speed across coding, tool use, and reasoning tasks—matching Sonnet 4 on coding, computer use, and agentic workflows at substantially lower cost and faster speeds.

Trust & Safety

We've conducted extensive testing and evaluation of Haiku 4.5, working with external experts to ensure it meets our standards for safety, security, and reliability. In the model card for this release, we discuss new safety results in several categories.

Hear from our customers

Our early testing shows that Claude Haiku 4.5 brings efficient code generation to GitHub Copilot with comparable quality to Sonnet 4 but at faster speed. Already we're seeing it as an excellent choice for Copilot users who value speed and responsiveness in their AI-powered development workflows.
Matthew Isabel
Distinguished Product Manager

Historically models have sacrificed speed and cost for quality. Claude Haiku 4.5 is blurring the lines on this trade off: it's a fast frontier model that keeps costs efficient and signals where this class of models is headed.
Jeff Wang
CEO

Claude Haiku 4.5 delivers intelligence without sacrificing speed, enabling us to build AI applications that utilize both deep reasoning and real-time responsiveness.
Ben Lafferty
Staff Engineer

Claude Haiku 4.5 is remarkably capable—just six months ago, this level of performance would have been state-of-the-art on our internal benchmarks. Now it runs up to 4-5 times faster than Sonnet 4.5 at a fraction of the cost, unlocking an entirely new set of use cases.
Andrew Filev
CEO

Claude Haiku 4.5 hit a sweet spot we didn't think was possible: near-frontier coding quality with blazing speed and cost efficiency. In Augment's agentic coding evaluation, it achieves 90% of Sonnet 4.5's performance, matching much larger models. We're excited to offer it to our users.
Guy Gur-Ari
Co-Founder

Claude Haiku 4.5 outperformed our current models on instruction-following for slide text generation, achieving 65% accuracy versus 44% from our premium tier model—that's a game-changer for our unit economics.
Jon Noronha
Co-Founder, Gamma

Speed is the new frontier for AI agents operating in feedback loops. Haiku 4.5 proves you can have both intelligence and rapid output. It handles complex workflows reliably, self-corrects in real-time, and maintains momentum without latency overhead. For most development tasks, it's the ideal performance balance.
Brad Axen
Tech Lead, AI

Claude Haiku 4.5 is a leap forward for agentic coding, particularly for sub-agent orchestration and computer use tasks. The responsiveness makes AI-assisted development in Warp feel instantaneous.
Zach Lloyd
Founder & CEO

01 / 08

When should I use Haiku 4.5?

Haiku 4.5 scores 73.3% on SWE-bench Verified, making it one of the world’s best coding models. Haiku 4.5 excels at parallelized execution, sub-agents, and high-volume operations, while Sonnet 4.5 delivers frontier performance for complex coding agents and sophisticated reasoning.

How much does it cost to use Haiku 4.5?

Pricing depends on how you want to use Haiku 4.5. To learn more, check out our pricing page.