Delphi Forecast Reports

GitHub Repo

Overview

Weekly Fanplots 2024-2025 Season

2024-2025 Season Reports

2023-2024 Season Backtesting

Description of Forecaster Families

The main forecaster families were:

Notes:

Autoregressive models (AR)

Internal name: scaled_pop.

A simple autoregressive model, which predicts using

xt+k=xt7+xt14

where x is the target variable (which can be scaled according to each state’s population or whitened according to another scheme (or both)), t is the data in days, and k{0,7,14,21,28} is the forecast horizon. In practice, we found that adding more lags provided no discernible advantage (such as (0, 7, 14) and (0, 7, 14, 21))

Autoregressive models with exogenous features

Internal name: scaled_pop_seasonal.

xt+k=xt7+xt14+yt7+yt14

where x is the target variable, y is the exogenous variable, and t is the data in days. See the list of exogenous features below. These models could opt into the same seasonal features as the scaled_pop_seasonal forecaster (see below).

Flu exogenous features

Covid exogenous features

Autoregressive models with seasonal features

Internal name: scaled_pop_seasonal.

We tried a few different attempts at incorporating seasonal features:

These models could be combined with the exogenous features (see above).

Autoregressive models with augmented data

Internal name: scaled_pop (with filter_source = "").

This forecaster is still the standard autoregressive model, but with additional training data. Inspired by UMass-flusion, the additional training data consisted of historical data from ILI+ and Flusurv+, which was brought to a comprable level with NHSN and treated as additional observations of the target variable (hence the name “augmented data”). Flusurv was taken from epidata, but ILI+ was constructed by Evan Ray and given to Richard (Berkeley Summer 2024 intern). Naturally, this forecaster was only used for flu, as the same data was not available for covid.

These models could be combined with the exogenous features and the seasonal features (see above).

Scaling Parameters (Data Whitening)

We tried a few different approaches to data whitening.

Climatological

This was our term for a forecaster that directly forecast a distribution built from similar weeks from previous seasons (in analogy with baseline weather forecasting). We found that in some cases it made a reasonable baseline, though when the current season’s peak time was significantly different from the seasons in the training data, it was not particularly effective.

Linear Trend

A simple linear trend model that predicts the median using linear extrapolation from the past 4 weeks of data and then uses residuals to create a distributional forecast.

Climate Linear

An ensemble model that combines a climatological forecast with a linear trend forecast. It is a bilinear interpolation between the two forecasts across the ahead and quantile extremity; as the quantile moves away from the median, and the ahead moves further in the future, the ensemble interpolates between the linear and climate forecasts. As the ahead goes from -1 to 4, it linearly interpolates between a 5% weight on the climate model and a 90% weight on the climate model (so the furthest ahead is mostly a climate model). At the same time, as the quantile level goes further away from the median, it interpolates between a 10% weight on the climate model at the median and a 100% weight on the climate model at either the 1% or 99% quantile levels. In net, at the median -1 ahead, the climate models have a weight of 0.5%, and the linear model of 99.5%.

No Recent Outcome

This was a fall-back forecaster built for the scenario where NHSN data was not going to be reported in time for the start of the forecasting challenge.

A flusion-adjacent model pared down to handle the case of not having the target as a predictor.

x¯t+k=ytkk=0:1+ytkt=0:3

where y here is any set of exogenous variables.

Flatline

A simple “LOCF” forecaster that simply forecasts the last observed value and uses residuals to create a distributional forecast. This is what the FluSight-baseline is based on, so they should be identical.

Exploration Summary 2024-2025

Here we summarize our findings from backtesting a large variety of forecasters on the 2023-2024 season.

Best Performing Families

Flu

The best performing families were:

Covid

The best performing families were:

Important Parameters

Important Notes

One of the most concerning behaviors in our forecasters was the bias towards predicting a down-swing in the target. After a deeper analysis, we concluded that this is due to a downward bias in the data set, which our linear AR models were picking up and translating into coefficients that were less than 1, making declines almost certain. The complete analysis can be found here.