Google AI Ships TimesFM-2.5: Smaller, Longer-Context Foundation Model That Now Leads GIFT-Eval (Zero-Shot Forecasting)

Understanding the Target Audience

The target audience for the TimesFM-2.5 model includes data scientists, machine learning engineers, business analysts, and decision-makers in industries that rely on time-series forecasting. Their pain points typically involve the complexity of model implementation, the need for accurate predictions, and the challenge of integrating AI tools into existing workflows. Their goals include leveraging AI to enhance forecasting accuracy, improving operational efficiency, and making data-driven decisions. They are interested in practical applications of AI, technical specifications, and best practices for implementation. Communication preferences lean towards concise, technical language with a focus on actionable insights.

What is Time-Series Forecasting?

Time-series forecasting is the practice of analyzing sequential data points collected over time to identify patterns and predict future values. It is vital in various industries, including:

Forecasting product demand in retail
Monitoring weather and precipitation trends
Optimizing large-scale systems such as supply chains and energy grids

By capturing temporal dependencies and seasonal variations, time-series forecasting enables data-driven decision-making in dynamic environments.

Changes in TimesFM-2.5 vs v2.0

Parameters: 200M (down from 500M in 2.0)
Max context: 16,384 points (up from 2,048)
Quantiles: Optional 30M-param quantile head for continuous quantile forecasts up to 1K horizon
Inputs: No “frequency” indicator required; new inference flags (flip-invariance, positivity inference, quantile-crossing fix)
Roadmap: Upcoming Flax implementation for faster inference; covariates support slated to return; documentation being expanded

Why Does a Longer Context Matter?

The 16K historical points allow a single forward pass to capture multi-seasonal structure, regime breaks, and low-frequency components without tiling or hierarchical stitching. This reduces pre-processing heuristics and improves stability for domains where context is greater than the horizon, such as energy load and retail demand. The longer context is a core design change explicitly noted for TimesFM-2.5.

Research Context

The core thesis of TimesFM—a single, decoder-only foundation model for forecasting—was introduced in a 2024 ICML paper and Google’s research blog. GIFT-Eval (Salesforce) has emerged to standardize evaluation across domains, frequencies, horizon lengths, and univariate/multivariate regimes, with a public leaderboard hosted on Hugging Face.

Key Takeaways

Smaller, Faster Model: TimesFM-2.5 operates with 200M parameters, which is half the size of 2.0, while improving accuracy.
Longer Context: Supports 16K input length, enabling forecasts with deeper historical coverage.
Benchmark Leader: Now ranks #1 among zero-shot foundation models on GIFT-Eval for both MASE (point accuracy) and CRPS (probabilistic accuracy).
Production-Ready: Efficient design and quantile forecasting support make it suitable for real-world deployments across industries.
Broad Availability: The model is live on Hugging Face.

Summary

TimesFM-2.5 demonstrates that foundation models for forecasting are evolving from proof-of-concept to practical, production-ready tools. By halving parameters, extending context length, and leading GIFT-Eval in both point and probabilistic accuracy, it signifies a significant advancement in efficiency and capability. With Hugging Face access already live and BigQuery/Model Garden integration forthcoming, the model is poised to accelerate the adoption of zero-shot time-series forecasting in real-world applications.

Further Resources

Check out the Model card on Hugging Face, the Repo, Benchmark, and the Paper. For tutorials, codes, and notebooks, visit our GitHub Page. Follow us on Twitter and join our 100k+ ML SubReddit. Don’t forget to subscribe to our Newsletter.