Forecasting
Predict future demand, sales, and trends accurately • 54 papers
Time Series Foundations
Build classic time series forecasting models
Time Series Analysis: Forecasting and Control
Foundational text establishing ARIMA and the Box-Jenkins approach to model identification.
A State Space Framework for Automatic Forecasting using Exponential Smoothing
Establishes ETS state space framework enabling model selection, prediction intervals, and likelihood calculation.
The M3-Competition: Results, Conclusions and Implications
Landmark 3,003-series competition establishing that simpler methods often outperform complex ones.
The M4 Competition: 100,000 Time Series and 61 Forecasting Methods
100,000 series showing hybrid statistical-ML methods dominate; pure ML underperformed.
STL: A Seasonal-Trend Decomposition Procedure Based on Loess
Foundational decomposition method separating seasonal, trend, and remainder components using local regression.
MSTL: A Seasonal-Trend Decomposition Algorithm for Time Series with Multiple Seasonal Patterns
Extension of STL handling multiple seasonal periods; essential for complex seasonality like hourly data.
Probabilistic Forecasting
Quantify uncertainty in your predictions
Strictly Proper Scoring Rules, Prediction, and Estimation
Definitive treatment of proper scoring rules; introduces CRPS, energy score, interval score.
Regression Quantiles
Introduced quantile regression for estimating conditional quantiles; foundation for distributional forecasting.
Probabilistic Energy Forecasting: Global Energy Forecasting Competition 2014 and Beyond
GEFCom2014 establishing practical standards for probabilistic forecast evaluation.
Conformal Time-series Forecasting
Distribution-free prediction intervals with guaranteed coverage for time series; handles non-exchangeability.
Multivariate Probabilistic Time Series Forecasting via Conditioned Normalizing Flows
Normalizing flows for flexible multivariate distributions in forecasting; captures complex dependencies.
Hierarchical & Grouped Forecasting
Forecast at multiple levels consistently
The Combination of Forecasts
Seminal paper showing combined forecasts yield lower MSE than individuals; foundation for ensemble methods.
Optimal Forecast Reconciliation for Hierarchical and Grouped Time Series Through Trace Minimization
MinT optimal reconciliation method minimizing forecast variance; outperforms bottom-up and top-down.
Chapter 4: Forecast Combinations
Authoritative review explaining why simple averages often beat optimal weights (estimation error, instability).
Forecast Reconciliation: A Review
Comprehensive review covering cross-sectional, temporal, and cross-temporal reconciliation.
Cross-temporal Forecast Reconciliation: Optimal Combination Method and Heuristic Algorithms
Unified framework for simultaneous cross-sectional and temporal reconciliation; optimal point forecasts.
Deep Learning for Forecasting
Apply deep learning to forecasting problems
DeepAR: Probabilistic Forecasting with Autoregressive Recurrent Networks
Amazon's global probabilistic forecasting model; pioneered cross-learning for scale-diverse series.
Forecasting at Scale (Prophet)
Meta's additive model with trend, seasonality, holidays; designed for analyst-in-the-loop forecasting.
N-BEATS: Neural Basis Expansion Analysis for Interpretable Time Series Forecasting
First pure DL model to beat M4 winner; interpretable version decomposes into trend and seasonality.
Temporal Fusion Transformers for Interpretable Multi-horizon Time Series Forecasting
Google's attention architecture handling static covariates, known future inputs, observed past-only features.
M5 Accuracy Competition: Results, Findings, and Conclusions
42,840 Walmart series; LightGBM and DL dominated; 22% improvement over best benchmark.
Deep State Space Models for Time Series Forecasting
Combines state space models with deep learning; enables interpretable components with neural flexibility.
Transformers & MLPs for Forecasting
Apply attention and MLP architectures to time series
Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting
AAAI Best Paper; ProbSparse attention achieving O(L log L) complexity for long sequences.
Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting
Novel auto-correlation mechanism replacing attention; built-in series decomposition improves interpretability.
FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting
Fourier-enhanced attention capturing global patterns in frequency domain with linear complexity.
A Time Series is Worth 64 Words: Long-term Forecasting with Transformers (PatchTST)
Patching time series like images; channel-independence achieving SOTA on long-term benchmarks.
TSMixer: An All-MLP Architecture for Time Series Forecasting
Google's MLP-Mixer adaptation showing simple MLPs can match or beat transformers on forecasting.
TSMixer: Lightweight MLP-Mixer Model for Multivariate Time Series Forecasting
IBM's TSMixer variant with cross-variate mixing; strong M5 performance with fewer parameters.
Foundation Models for Time Series
Apply pre-trained models to forecasting tasks
Chronos: Learning the Language of Time Series
Amazon's tokenized time series foundation model; zero-shot forecasting matching fine-tuned models.
TimeGPT-1
First commercial time series foundation model; API-based zero-shot forecasting for practitioners.
Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting
Open-source decoder-only foundation model for probabilistic forecasting; strong zero-shot transfer.
MOMENT: A Family of Open Time-series Foundation Models
CMU's open foundation model supporting forecasting, classification, anomaly detection, and imputation.
A Decoder-Only Foundation Model for Time-Series Forecasting (TimesFM)
Google's 200M parameter foundation model trained on 100B time points; strong zero-shot performance.
Unified Training of Universal Time Series Forecasting Transformers (Moirai)
Salesforce's multi-patch model handling variable frequencies and prediction lengths in single model.
Time-LLM: Time Series Forecasting by Reprogramming Large Language Models
Repurposing frozen LLMs for forecasting via prompt reprogramming; no time series pre-training needed.
Intermittent Demand Forecasting
Forecast sparse, irregular demand patterns
Forecasting and Stock Control for Intermittent Demands
Foundational method separating demand size from demand occurrence; basis for spare parts forecasting.
The Accuracy of Intermittent Demand Estimates
SBA method correcting Croston's bias; widely adopted in supply chain software.
Intermittent Demand Forecasting with Context-Aware Learning
TSB method explicitly modeling demand probability; improved coverage for slow-moving items.
GluonTS: Probabilistic and Neural Time Series Modeling in Python
Amazon's open-source toolkit; includes intermittent demand models and extensive benchmarking.
Time Series Anomaly Detection
Detect anomalies and change points in time series
Bayesian Online Changepoint Detection
Elegant Bayesian framework for online change point detection; foundational for streaming applications.
Time-Series Anomaly Detection Service at Microsoft
Microsoft's production system using spectral residual and CNN; handles millions of time series.
Robust Random Cut Forest Based Anomaly Detection on Streams
Amazon's streaming anomaly detection algorithm; efficient updates and interpretable anomaly scores.
OmniAnomaly: Robust Anomaly Detection for Multivariate Time Series through Stochastic Recurrent Neural Network
VAE-based approach for multivariate anomaly detection; captures temporal dependencies and normal patterns.
Causal Impact & Intervention
Measure the impact of events or launches
Inferring Causal Impact Using Bayesian Structural Time-Series Models
Google's CausalImpact for estimating causal effects from observational time series; widely used for attribution.
Synthetic Control Methods for Comparative Case Studies
Foundational synthetic control paper; 'most important innovation in evaluation literature in 15 years'.
Using Synthetic Controls: Feasibility, Data Requirements, and Methodological Aspects
Comprehensive methodological review covering feasibility, inference, and best practices.
Predicting the Present with Bayesian Structural Time Series
Google's BSTS framework for nowcasting with spike-and-slab regression for variable selection.
Difference-in-Differences with Multiple Time Periods
Modern DiD handling staggered treatment timing and heterogeneous effects; doubly-robust estimator.
What's Trending in Difference-in-Differences? A Synthesis of the Recent Econometrics Literature
Comprehensive synthesis of modern DiD developments; essential guide to recent methodological advances.
Production Forecasting Systems
Build and evaluate forecasting systems at scale
Orbit: Probabilistic Forecast with Exponential Smoothing
Uber's Bayesian forecasting framework combining ETS with global models; production-ready with uncertainty.
Forecasting at Scale (Prophet)
Meta's production system enabling analysts to create reliable forecasts with domain knowledge.
Demand Forecasting at Alibaba: Practice and Lessons Learned
Large-scale demand forecasting system handling billions of SKUs; hierarchical and cross-learning approaches.
150 Successful Machine Learning Models: 6 Lessons Learned at Booking.com
Practical lessons from production ML including forecasting; emphasizes offline-online gaps and iteration.
Comparing Predictive Accuracy
The Diebold-Mariano test for forecast comparison; essential for model selection in production.