Rui Wang, Yihe Dong, Sercan O. Arik, Rose Yu; Koopman Neural Forecaster for Time Series with Temporal Distribution Shifts
Temporal distributional shifts, with underlying dynamics changing over time, frequently occur in real-world time series, and pose a fundamental challenge for deep neural networks (DNNs). In this paper, we propose a novel deep sequence model based on the Koopman theory for time series forecasting: Koopman Neural Forecaster (KNF) that leverages DNNs to learn the linear Koopman space and the coefficients of chosen measurement functions. KNF imposes appropriate inductive biases for improved robustness against distributional shifts, employing both a global operator to learn shared characteristics, and a local operator to capture changing dynamics, as well as a specially-designed feedback loop to continuously update the learnt operators over time for rapidly varying behaviors. To the best of our knowledge, this is the first time that Koopman theory is applied to real-world chaotic time series without known governing laws. We demonstrate that KNF achieves the superior performance compared to the alternatives, on multiple time series datasets that are shown to suffer from distribution shifts.
To install requirements
pip install -r requirements.txt
-
data/: download raw datasets and preprocess raw data.
- M4/m4_data_gen.py: download M4 data from the M4 git repo and generate M4 train/test sets.
- Cryptos/cryptos_data_gen.py: generate cryptos train/test sets.
- PlayerTraj/traj_data_gen.py: generate basketball trajectory train/test sets.
- sample_data: a small subset of M4-weekly data for testing purposes.
- data_analysis: functions for forecastability, trend and seasonality analysis.
-
modules/: pytorch modules for the KNF.
- data_classes.py: pytorch datasets for M4, Cryptos, PlayerTraj and sample data.
- train_utils.py: training and evaluation functions.
- normalizer.py: pytorch implementation of reversible instance normalization.
- models.py: main KNF modules.
- eval_metrics.py: three different evaluation metrics for three different datasets.
-
run_koopman.py: training script for KNF.
-
args.py: hyperparameters.
-
run.sh: train KNF on the small sample dataset.
-
run_exp.sh: train KNF on all M4, Cryptos, PlayerTraj dataset.
- M4: Download and preprocess M4 data
python data/M4/m4_data_gen.py
- Cryptos: Download
train.csv
andasset_details.csv
from kaggle to currentdata/Cryptos
folder. Runcryptos_data_gen.py
to preprocess Cryptos data
python data/Cryptos/cryptos_data_gen.py
- Traj: Download NBA basketball player trajectory data and unzip all .7z files in
data/PlayerTraj/json_data
. Runtraj_data_gen.py
to preprocess Trajectory data
python data/PlayerTraj/traj_data_gen.py
- run
run.sh
to train a small KNF on a small subset of M4-weekly data indata/sample_data
sh run.sh
- run
run_exp.sh
to train three KNF models on all three datasets.
sh run_exp.sh
- Step1: Save the training and test sets of the new dataset separately as numpy arrays into two npy files. Both should have the shape of (number of time series, length, number of features).
- Step2: Use the
CustomDataset
class inmodules/data_classes.py
to load the data and specify the paths to train and test data files in argumentsdirec
anddirec_test
. - Step3: Change the default
num_feats
inargs.py
accordingly.
If you find this repo useful, please cite our paper.
@inproceedings{wang2022koopman,
title={Koopman Neural Forecaster for Time Series with Temporal Distribution Shifts},
author={Rui Wang and Yihe Dong and Sercan O. Arik and Rose Yu},
journal={arXiv preprint arXiv:2210.03675},
year={2022}
}