←back to Blog

Anthropic Launches Claude Haiku 4.5: Small AI Model that Delivers Sonnet-4-Level Coding Performance at One-Third the Cost and more than Twice the Speed

Anthropic Launches Claude Haiku 4.5: Small AI Model that Delivers Sonnet-4-Level Coding Performance at One-Third the Cost and More than Twice the Speed

Target Audience Analysis

The primary audience for the Claude Haiku 4.5 launch includes software developers, data scientists, and business managers in technology-driven industries. These professionals are often looking for cost-effective solutions that enhance productivity and streamline workflows. Their pain points include:

  • High operational costs associated with AI model deployment.
  • Need for faster processing times to improve user experience.
  • Challenges in integrating new technologies into existing systems.

Their goals involve leveraging AI to improve coding efficiency, reduce costs, and enhance customer support automation. They are interested in technical specifications, performance benchmarks, and real-world applications of AI models. Communication preferences lean towards concise, data-driven content with clear technical insights.

Overview of Claude Haiku 4.5

Anthropic has introduced Claude Haiku 4.5, a latency-optimized “small” model that matches the coding performance of Claude Sonnet 4 while operating at more than twice the speed and one-third the cost. This model is available via Anthropic’s API and through partner catalogs on Amazon Bedrock and Google Cloud Vertex AI. The pricing structure is set at $1/MTok for input and $5/MTok for output.

Positioning and Use Cases

Haiku 4.5 is designed for real-time applications such as:

  • Interactive assistants
  • Customer support automation
  • Pair programming

It outperforms Sonnet 4 in computer-use tasks, enhancing responsiveness in tools like Claude for Chrome and Claude Code for multi-agent projects. Anthropic emphasizes that while Sonnet 4 remains the leading model, Haiku 4.5 provides near-frontier performance with significant cost efficiency. A recommended approach is to use Sonnet 4 for multi-step planning while deploying multiple Haiku 4.5 workers for execution.

Availability and Pricing

Developers can access the model (claude-haiku-4-5) through Anthropic’s API immediately. It is also available on Amazon Bedrock and Google Cloud Vertex AI. The pricing details are as follows:

  • Input: $1/MTok
  • Output: $5/MTok
  • Prompt-caching: $1.25/MTok write and $0.10/MTok read

Performance Benchmarks

Anthropic provides several benchmarks to validate Haiku 4.5’s performance:

  • SWE-bench Verified: Achieved an average of 73.3% over 50 trials with a 128K thinking budget.
  • Terminal-Bench: Average performance over 11 runs with varying thinking budgets.
  • OSWorld-Verified: Performance averaged across 4 runs with a 128K total thinking budget.
  • AIME / MMMLU: Averages over multiple runs using default sampling and 128K thinking budgets.

Users are encouraged to replicate these benchmarks with their own orchestration and tool stacks to validate performance in their specific contexts.

Key Takeaways

  • Haiku 4.5 delivers Sonnet-4-level coding performance at one-third the cost and more than twice the speed.
  • It surpasses Sonnet 4 on computer-use tasks, enhancing responsiveness in Claude for Chrome and Claude Code.
  • Recommended orchestration involves using Sonnet 4 for multi-step planning and parallel execution with multiple Haiku 4.5 workers.
  • Pricing is set at $1/$5 per million input/output tokens, with availability through the Claude API, Amazon Bedrock, and Google Cloud Vertex AI.
  • Released under ASL-2 with a lower measured misalignment rate compared to Sonnet 4.5 and Opus 4.1.

Conclusion

Anthropic’s strategic positioning of Claude Haiku 4.5 offers developers a cost-effective solution that maintains high performance without requiring significant architectural changes. This model is poised to facilitate enterprise adoption, particularly in environments where safety and cost are critical factors.

For further technical details, system card, model page, and documentation, please refer to the official Anthropic website.