Claude 4.5 Haiku
Our fastest model, a lightweight version of our most powerful AI, at a more affordable price
Announcements
- New
Claude Haiku 4.5
Oct 15, 2025
Claude Haiku 4.5 is our fastest, most cost-efficient model, matching Sonnet 4’s performance on coding, computer use, and agent tasks. Claude Haiku 4.5 scores 73.3% on SWE-bench Verified, making it one of the world's best coding models.
Read more
Claude 3.5 Haiku
Oct 22, 2024
For a similar speed to Haiku 3, Haiku 3.5 improved across every skill set and surpassed Opus 3, the largest model in our previous generation, on many intelligence benchmarks.
Read more
Availability and pricing
Anyone can chat with Claude using Haiku 4.5 on Claude.ai, available on web, iOS, and Android.
Pricing for Haiku 4.5 on the Claude Developer Platform starts at $1 per million input tokens and $5 per million output tokens, with up to 90% cost savings with prompt caching and 50% cost savings with the Message Batches API.
Learn more at our pricing page.
Use cases
Haiku 4.5 brings advanced capabilities at a price that makes Claude practical for scaled deployments, from powering free tier products to budget-conscious applications that still need strong performance.
Powering free user experiences
Haiku 4.5 is fast and affordable enough to power AI agents for all your users, even in your free tier.
Latency-sensitive experiences
Haiku 4.5 is now ideal for real-time applications like customer service agents and chatbots where response time is critical.
Coding sub-agents
Haiku 4.5 powers sub-agents, enabling multi-agent systems that tackle complex refactors, migrations, and large feature builds with quality and speed.
Financial analysis
Haiku 4.5 can monitor thousands of data streams at once—tracking regulatory changes, market signals, and portfolio risks in real time.
Research sub-agents
Haiku 4.5 can tackle dozens of research sources simultaneously, from literature reviews to data synthesis, in hours instead of weeks.
Benchmarks
Haiku 4.5 delivers strong performance and speed across coding, tool use, and reasoning tasks—matching Sonnet 4 on coding, computer use, and agentic workflows at substantially lower cost and faster speeds.
Trust & Safety
We've conducted extensive testing and evaluation of Haiku 4.5, working with external experts to ensure it meets our standards for safety, security, and reliability. In the model card for this release, we discuss new safety results in several categories.
Hear from our customers
Our early testing shows that Claude Haiku 4.5 brings efficient code generation to GitHub Copilot with comparable quality to Sonnet 4 but at faster speed. Already we're seeing it as an excellent choice for Copilot users who value speed and responsiveness in their AI-powered development workflows.
Historically models have sacrificed speed and cost for quality. Claude Haiku 4.5 is blurring the lines on this trade off: it's a fast frontier model that keeps costs efficient and signals where this class of models is headed.
Claude Haiku 4.5 delivers intelligence without sacrificing speed, enabling us to build AI applications that utilize both deep reasoning and real-time responsiveness.
Claude Haiku 4.5 is remarkably capable—just six months ago, this level of performance would have been state-of-the-art on our internal benchmarks. Now it runs up to 4-5 times faster than Sonnet 4.5 at a fraction of the cost, unlocking an entirely new set of use cases.
Claude Haiku 4.5 hit a sweet spot we didn't think was possible: near-frontier coding quality with blazing speed and cost efficiency. In Augment's agentic coding evaluation, it achieves 90% of Sonnet 4.5's performance, matching much larger models. We're excited to offer it to our users.
Claude Haiku 4.5 outperformed our current models on instruction-following for slide text generation, achieving 65% accuracy versus 44% from our premium tier model—that's a game-changer for our unit economics.
Speed is the new frontier for AI agents operating in feedback loops. Haiku 4.5 proves you can have both intelligence and rapid output. It handles complex workflows reliably, self-corrects in real-time, and maintains momentum without latency overhead. For most development tasks, it's the ideal performance balance.
Claude Haiku 4.5 is a leap forward for agentic coding, particularly for sub-agent orchestration and computer use tasks. The responsiveness makes AI-assisted development in Warp feel instantaneous.
Frequently asked questions
Haiku 4.5 scores 73.3% on SWE-bench Verified, making it one of the world’s best coding models. Haiku 4.5 excels at parallelized execution, sub-agents, and high-volume operations, while Sonnet 4.5 delivers frontier performance for complex coding agents and sophisticated reasoning.
Pricing depends on how you want to use Haiku 4.5. To learn more, check out our pricing page.