Skip to content

ScrippsDC/neglectedBridges

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Neglected Bridges

This repository contains data and code to reproduce the data findings in Thousands of bridges left behind in race to rebuild infrastructure, published on August 10, 2023.

Namely:

More than 14,000 bridges in all 50 states and Washington, D.C., have been ranked in poor condition for at least a decade. Combined, they carry over 46 million passengers every day. While a bridge in "poor" condition doesn't mean it will collapse, it may require weight limits for trucks and more frequent inspections. Bridges in poor condition are at greater risk of closure for safety concerns. Repairing all of the poor bridges identified in the Scripps News analysis would cost at least $97 billion.

A follow-up piece, Buttigieg defends effort to fix ailing bridges across the country, was published on August 22, 2023.

Data

The National Bridge Inventory was the main data source of this piece, updated annually. Data on bridge conditions from 2014-2022 is from the InfoBridge portal.

Under the "Bridge Condition Transition History" tab, we filtered for bridges that had a poor rating in 2014 and a poor rating in 2022. (During ETL, we filtered for bridges that were (1) poorly rated or (2) not rated from 2015-2021.) This file is saved at data/infobridge/Poor_2014_Poor_2022.txt.

The 2023 NBI data was released on June 27th, 2023 but not yet available in InfoBridge. As a result, we downloaded the 2023 data directly from the NBI site, specifically the version under "Download all records. Includes non-highway and routes under bridges zip file (57 mb)." This file is saved at data/NBI/2023AllRecordsDelimitedAllStates.txt but is not included in this repository because it is larger than GitHub's 100MB file limit.

Documentation for the NBI can be found here.

ETL and analysis

All the etl and analysis for this piece is in the file etl_analysis/1_poor.py, which was run in the terminal.

ETL summary:

Start with:

  • Poor_2014_Poor_2022.txt: 20,008 rows
  • 2023AllRecordsDelimitedAllStates.txt: 737,137 rows

Filter 2023AllRecordsDelimitedAllStates.txt for poor bridges. Assign to variable nbi: 42,404 rows

Filter Poor_2014_Poor_2022.txt for bridges that were also poor for all years (or had gaps in their inspection record) from 2015-2021. Assign to variable all_poor: 18,354 rows

Create a unique identifier in both filtered datasets (nbi and all_poor):

  • Add a state fips code to nbi, (all_poor already has one)
  • Remove leading and trailing whitespace, and leading zeroes from bridge ID.
  • Combine the cleaned bridge ID with state fips to create a column 'ID'

Use 'ID' to filter nbi only for bridges that also appear in all_poor -- i.e. bridges that were poor in 2014, 2022, 2023, and were either poor or had a missing inspection record for all years 2015 to 2021. Assign to variable final: 16,220 rows

Filter final only for bridges that were open in 2023 (aka where 'OPEN_CLOSED_POSTED_041' was not “K”), overwrite final: 14,570 rows

Output saved at: data/processed/etl_1_poor.csv

Analysis:

  • Count the number of bridges: 14,570
  • Count the number of states: 50 states + DC + Puerto Rico (52 total)
  • Sum the average daily traffic ('ADT_029'): 46,587,345 vehicles
  • Sum the costs of improvement ('TOTAL_IMP_COST_096', treating NaNs as $0): $97,366,070,000

Interactive Graphics

Map

The interactive map in the web story maps the bridges in final by their coordinates in the 2022 data in infobridge (data/infobridge/NBI_2022_Poor.txt), joined by the same unique identifier we constructed in the ETL step (column 'ID').

For the following cases, the 2022 coordinates were clearly outside of the United States, and we manually replaced the coordinates with data from previous years:

The dataset to create the Flourish graphic is at: data/manual/flourish_map.csv.

Bar Chart

The follow-up piece has an additional graphic showing the increase in bridges needing repair over the past decade. The script etl_analysis/2_bridge_condition_counts.py tabulates the condition of bridges in the NBI from 2016-2023, downloading each year directly from the NBI site. This is the same source we used for 2023 in the main analysis. We save the tabulated data at data/processed/etl_2_bridge_condition_counts.csv.

We checked the counts from 2016-2022 with the numbers published by the Bureau of Transportation Statistics, and found that they were the same. As a result, the final graphic uses 2014-2022 numbers from the Bureau of Transportation Statistics and 2023 numbers from data/processed/etl_2_bridge_condition_counts.csv.

Other data elements

In the video piece, there is a map of counties that received funding from the Bipartisan Infrastructure Law. The list of counties was taken from the Bipartisan Infrastructure Law (BIL) Maps Dashboard, filtering for projects containing the word "bridge". To get the coordinates, the counties were mapped in Tableau and their coordinates were exported as a csv and deduplicated. The list of coordinates can be found at data/manual/BIL_bridge_project_coordinates_dedupe_reformat.csv.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages