Research Archive

by Inan Bostanci

This archive contains the code to reproduce the results from my study on traffic demand modeling. The study combines traffic loop sensor data and administrative data to validate and calibrate traffic estimations on a nationwide level. It was done in collaboration with CBS, the Dutch Central Agency for Statistics. CBS has a larger project on traffic demand estimation in the Netherlands called DaCiMob, and this study is part of that project. The study is also my thesis project, which was supervised by Dr. Peter Lugtig from Utrecht University and Yvonne Gootzen from CBS. It was approved by the Ethical Review Board of the Faculty of Social and Behavioural Sciences of Utrecht University under file number 21-2133. The archive is located in this repository on GitHub.

In this repository, there are three jupyter notebook scripts, a csv file and a license. The license contains all information you need if you want to use this repository. Understanding the content and structure is easier if you have read the thesis. Below is a table of contents. Following that, I explain the structure of the scripts and the csv files, as well as other data that was used in this project and cannot be published openly. I describe alternatives to generate synthetic data.

File	Purpose	Type
load_preprocess_sensors.ipynb	Preprocessing traffic loop sensor data	Jupyter notebook
link_obs_exp.ipynb	Linking observed to expected traffic counts and creating figures for section 4 of thesis	Jupyter notebook
inspect_model_c.ipynb	Inspecting and modeling the calibration factor and creating figures and tables for section 5 and appendix of thesis	Jupyter notebook
edges_intensities.csv	Aggregated and preprocessed observed traffic counts from traffic loop sensors	Data
LICENSE	License of this repository	License

Scripts

At the beginning of each script, I import all packages that are needed. If you do not have one of the packages, you can install them with !pip install PACKAGE. I worked in Jupyter notebook scripts with python v3.8.5. The main packages used are pandas v1.3.4, NumPy v.1.21.4, GeoPandas v0.10.1 and OSMnx v1.1.2. Output is stored within the Jupyter notebook script, but I included optional "checkpoints", which you can uncomment to save data frames or figures to csv or pdf for later usage (note: It was necessary for me to work in Jupyter notebook due to the internal structure at CBS).
The scripts should be run in the following order and are structured as follows:

load_preprocess_sensors.ipynb

This notebook contains the method that I used to load and preprocess the sensor data from CBS. This is not as trivial as you would expect, due to the size of this data set. If you do not have access to the internal CBS server, you cannot run this script.

link_obs_exp.ipynb

First, the sensor data, that resulted in the previous script, is aggregated. If you do not have access to CBS, it is indicated at which point you can start using edges_intensites.csv I also load the road network data from OpenStreetMap. This takes a while (an hour or more), because it loads the entire Dutch road network. I inspect and visualize the observed traffic counts. Then, I load the expected counts that were provided by CBS. If you do not have access to CBS data, you will find a chunk that you can uncomment to generate synthetic data. I inspect and visualize the expected counts and link them to the observed counts. To make sure you don't have to reload the entire road network from OSM again for the next script, I included chunks that store the desired objects in the environment, which then can be called in another script.

inspect_model_c.ipynb

To make sure you don't have to load the entire road network from OSM again, I call the stored objects at the beginning of this script. After that, I inspect the calibration factor and the quality of the expected counts as an estimate for the observed counts visually and numerically. I then run multiple "calibration models" that I compare, to predict the calibration factor on road segments that do not have observed counts. Finally, I inspect whether the calibration on the basis of the calibration model improved the expected counts both visually and numerically.

Data

edges_intensities.csv

Nationaal Dataportaal

Regional data

here

Infrastructure data

Infrastructure data on the road network is directly loaded from OpenStreetMap using the package osmnx as a graph with nodes and edges.

Expected traffic counts

Gootzen et al. 2020

Boonstra et al. 2021

CBS infoservice

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.DS_Store		.DS_Store
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
edges_intensities.csv		edges_intensities.csv
inspect_model_c.ipynb		inspect_model_c.ipynb
link_obs_exp.ipynb		link_obs_exp.ipynb
load_preprocess_sensors.ipynb		load_preprocess_sensors.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Research Archive

by Inan Bostanci

Scripts

Data

About

Releases

Packages

Languages

License

iebos/dacimob

Folders and files

Latest commit

History

Repository files navigation

Research Archive

by Inan Bostanci

Scripts

Data

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages