Question 1

What are the 10 AI agents used in enterprise time series forecasting?

Accepted Answer

The 10 agents are: (1) Project Scoping — translates business briefs into structured specifications; (2) Data Discovery — inventories and profiles all source tables; (3) Data Quality Assessment — scores quality across six dimensions; (4) Data Cleaning — executes remediations on versioned data copies; (5) Feature Engineering — builds a centralised versioned feature store; (6) Series Classification — assigns optimal model configuration per series archetype; (7) Training and Validation — runs walk-forward cross-validation with Optuna hyperparameter search; (8) Production Pipeline — orchestrates weekly Airflow DAG on Dask Kubernetes; (9) Monitoring and Drift Detection — tracks PSI, rolling MAPE, and schema changes; (10) Reporting — generates audience-tailored narratives from warehouse to board level.

Question 2

What machine learning models are used in the forecasting ensemble?

Accepted Answer

The stacking ensemble combines five base learners: Prophet (additive decomposition, strong for seasonal series with calendar effects), LightGBM (gradient boosted trees, strong for cross-series patterns), a bidirectional LSTM (captures long-range sequential dependencies), N-BEATS (neural basis expansion, strong for intermittent series), and a Temporal Fusion Transformer (state-of-the-art for multi-horizon probabilistic forecasting). A Ridge regression meta-learner trained on out-of-fold predictions combines the five models. Only configurations passing MAPE, bias, RMSSE, and probabilistic coverage thresholds at all horizons across five folds are promoted to staging.

Question 3

How does walk-forward cross-validation work for time series?

Accepted Answer

Walk-forward cross-validation uses an expanding training window that mimics real production conditions. The first fold trains on months 1–12 and validates on months 13–15. Each subsequent fold expands the training window by three months. This ensures the model is always trained on data that would have been available at forecast time and validated on genuinely unseen future data — preventing the data leakage that occurs with standard k-fold cross-validation on time series, where future information can contaminate the training set.

Question 4

What is the Population Stability Index and why is it used for forecast monitoring?

Accepted Answer

Population Stability Index (PSI) measures the shift in a statistical distribution between a baseline period (training data) and a monitoring period (recent production data). A PSI below 0.1 indicates no meaningful shift, 0.1–0.2 indicates moderate shift requiring investigation, and above 0.2 triggers automatic re-training. PSI is applied to every feature in the production feature store, detecting when the statistical patterns the model learned during training no longer represent current data — an early warning signal for forecast accuracy degradation before it becomes visible in MAPE metrics.

Question 5

How does the guide handle intermittent demand series?

Accepted Answer

Intermittent demand series — SKUs with many zero-demand periods and sporadic spikes — require different modelling approaches than high-volume stable series. The Series Classification Agent assigns intermittent series to the Intermittent archetype, triggering a specialised configuration: Croston’s method or TSB model for the base forecast, combined with a global cross-series LightGBM trained on all intermittent series simultaneously to leverage pattern sharing. The zero-inflated probabilistic output accounts for both the probability of non-zero demand and its expected magnitude.

Question 6

What does the production Airflow DAG do each week?

Accepted Answer

The weekly Airflow DAG runs on a dynamically scaling Dask Kubernetes cluster and: (1) ingests the latest week of data from all source systems; (2) updates the feature store; (3) scores all 204,000 series using the current production model stack; (4) runs seven automated sanity checks on the forecast output (magnitude bounds, direction consistency, aggregate coherence, coverage validation, bias check, seasonality alignment, holdout comparison); (5) publishes results to a REST API and BI dashboard — all completing before 06:00 every Monday morning.

Question 7

How are forecasts delivered to different business audiences?

Accepted Answer

The Reporting Agent generates four audience-tailored outputs: (1) SKU-level re-order quantity recommendations with confidence intervals for warehouse and logistics teams; (2) category and channel demand forecasts with risk flags for supply chain planning; (3) P&L revenue ranges with P10/P50/P90 uncertainty bands for finance and FP&A; and (4) a one-page strategic outlook with scenario probabilities for the board. LLM-generated plain-English narratives explain the key drivers behind significant forecast changes.

AI Agent Orchestration for Enterprise Time Series Forecasting

What This Guide Covers

The Ten Agent Architecture

Data Quality Assessment — Six Dimensions and the Quality Firewall

Feature Engineering and the Versioned Feature Store

Production Pipeline — Weekly Airflow DAG on Dask Kubernetes

Topics Covered in This Guide

Read the Full Guide + Download Free Sample

Frequently Asked Questions

Brief Summary

Extended Summary

Related Guides in the SimuPro Knowledge Store

SimuPro Data Solutions — Cloud Data Engineering & AI Consultancy