Mobility baselines

Baselines and benchmarks for spatio-temporal forecasting problems in transportation

This repository contains source code for reproducing the results in:

Rodrigues, F. On the importance of stationarity, strong baselines and benchmarks in transport prediction problems. In ArXiv, 2022

It consists of code for computing the "HA" and "HA+LR" baselines described in the paper, as well as code for preparing the experimental setups (i.e., train/val/test splits, forecasting horizons, evaluation metrics, etc.) for the following 9 publicly-available datasets:

Name	Type	Timespan	Time granularity	Train/val/test split	Source
PeMSD7(M) - California	traffic speeds	01/04/2016 - 30/06/2016	5 minutes	34/5/5 days	Yu et al., 2018 [1]
Urban1 - South Korea	traffic speeds	01/04/2018 - 30/04/2018	5 minutes	70/10/20 %	Lee and Rhee, 2022 [2]
NYC Citi Bike - New York	pickups and dropoffs	01/04/2016 - 01/04/2016	30 minutes	63/14/14 days	Ye et al., 2021 [3]
PeMSD4 - California	traffic volumes	01/01/2018 - 28/02/2018	5 minutes	60/20/20 %	Choi et al., 2022 [4]
SZ-taxi - Shenzhen	traffic speeds	01/01/2015 - 31/01/2015	15 minutes	80/-/20 %	Zhao et al., 2021 [5]
METR-LA - Los Angeles	traffic speeds	01/03/2012 - 30/06/2012	5 minutes	70/10/20 %	Li et al., 2018 [6]
PEMS-BAY - California	traffic speeds	01/01/2017 - 31/05/2017	5 minutes	70/10/20 %	Li et al., 2018 [6]
NYC Citi Bike - New York	in- and out-flows	01/07/2017 - 30/09/2017	1 hour	80/10/10 %	Xia et al., 2021 [7]
Seattle loop data - Seattle	traffic speeds	01/11/2015 - 31/12/2015	5 minutes	56/-/5 days	Yang et al., 2021 [8]

The goal is to facilitate the comparison between different spatio-temporal forecasting approaches by providing multiple well-defined reference benchmarks.

The repo does not include the actual data, but the table below provides links to where the data can be downloaded:

Name	Download link	Jupyter notebook with baseline
PeMSD7(M) - California	Link (Mirror)	STGCN - Historical average baselines-FINAL.ipynb
Urban1 - South Korea	Link (Mirror)	DDP-GCN - Historical average baselines-FINAL.ipynb
NYC Citi Bike - New York	Link (Mirror)	CGCDemand - Historical average baselines-seq2seq.ipynb
PeMSD4 - California	Link (Mirror)	STG-NCDE - Historical average baselines-FINAL-seq2seq.ipynb
SZ-taxi - Shenzhen	Link (Mirror)	T-GCN - Mobility baselines.ipynb
METR-LA - Los Angeles	Link (Mirror)	DCRNN - Mobility baselines - METR-LA.ipynb
PEMS-BAY - California	Link (Mirror)	DCRNN - Mobility baselines - PEMS-BAY.ipynb
NYC Citi Bike - New York	Link (Mirror)	3D-DGCN - Mobility baselines-FINAL.ipynb
Seattle loop data - Seattle	Link (Mirror)	TransPAI - Mobility baselines.ipynb

The file mobility_detrender.py contains the code for computing the historical averages (HA).

References:

[1] B. Yu, H. Yin, and Z. Zhu, “Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting,” in 27th International Joint Conference on Artificial Intelligence (IJCAI-18), 2018.
[2] K. Lee and W. Rhee, “DDP-GCN: Multi-graph convolutional network for spatiotemporal traffic forecasting,” Transportation Research Part C: Emerging Technologies, vol. 134, p. 103466, 2022.
[3] J. Ye, L. Sun, B. Du, Y. Fu, and H. Xiong, “Coupled layer-wise graph convolution for transportation demand prediction,” in Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI-21), 2021.
[4] J. Choi, H. Choi, J. Hwang, and N. Park, “Graph neural controlled differential equations for traffic forecasting,” in Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22), 2022.
[5] L. Zhao, Y. Song, C. Zhang, Y. Liu, P. Wang, T. Lin, M. Deng, and H. Li, “T-GCN: A temporal graph convolutional network for traffic prediction,” IEEE Transactions on Intelligent Transportation Systems, vol. 21, no. 9, pp. 3848–3858, 2019.
[6] Y. Li, R. Yu, C. Shahabi, and Y. Liu, “Diffusion convolutional recurrent neural network: Data-driven traffic forecasting,” in International Conference on Learning Representations (ICLR-18), 2018.
[7] T. Xia, J. Lin, Y. Li, J. Feng, P. Hui, F. Sun, D. Guo, and D. Jin, “3DGCN: 3-dimensional dynamic graph convolutional network for citywide crowd flow prediction,” ACM Transactions on Knowledge Discovery from Data (TKDD), vol. 15, no. 6, pp. 1–21, 2021.
[8] J.-M. Yang, Z.-R. Peng, and L. Lin, “Real-time spatiotemporal prediction and imputation of traffic status based on lstm and graph laplacian regularized matrix factorization,” Transportation Research Part C: Emerging Technologies, vol. 129, p. 103228, 2021.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
3D-DGCN-master		3D-DGCN-master
CGCDemandPrediction-main		CGCDemandPrediction-main
DCRNN-master		DCRNN-master
DDP-GCN-main		DDP-GCN-main
STG-NCDE-main		STG-NCDE-main
STGCN_IJCAI-18-master		STGCN_IJCAI-18-master
T-GCN-master		T-GCN-master
TransPAI-master		TransPAI-master
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
evaluation.py		evaluation.py
mobility_detrender.py		mobility_detrender.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mobility baselines

About

Releases

Packages

Languages

cuizy1017/mobility-baselines

Folders and files

Latest commit

History

Repository files navigation

Mobility baselines

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages