Corn Soy Data Layer

This repo contains code that walks through key steps to create and validate the Corn-Soy Data Layer (CSDL), a map that classifies corn and soybean in 13 states in the US Midwest from 1999-2018 at 30m resolution. Although the USDA's Cropland Data Layer (CDL) offers crop type maps across the conterminous US from 2008 onward, such maps are missing in many Midwestern states or are uneven in quality before 2008. To fill these data gaps, we used the now-public Landsat archive and cloud computing services to map corn and soybean, the primary crops in the Midwest, back to 1999.

Dataset

Our dataset can be accessed through one of two ways:

Google Earth Engine asset here
Zenodo repo housing GeoTIFFs here

Map legend:

0 = outside study area
1 = corn
5 = soy
9 = other crop
255 = non-crop (masked by NLCD)

Values were chosen to be consistent with CDL values when possible.

When using the dataset, please cite:

Usage Notes

We recommend that users consider metrics such as (1) user's and producer's accuracy with CDL and (2) R2 with NASS statistics across space and time to determine in which states/counties and years CSDL is of high quality. This can be done with the CSV file of user's and producer's accuracies and annual county-level statistics we have included in this repo.

Code dependencies

To sample training points: R version 3.5.1, dplyr 0.8.0.1, sf 0.6-3, raster 2.6-7, rgdal 1.3-4, salustools 0.1.0, sp 1.3-1
To train our classifier and create the final maps: Google Earth Engine
To perform analyses: Python 3.7.3, numpy 1.16.4, pandas 0.24.2, matplotlib 3.1.0, sklearn 0.21.2, plotly 4.5.0

Map creation

Sample a set of training coordinates. [R Markdown file]
Export Landsat harmonic regression features. [Earth Engine script]
After feature selection, assemble data into a dataframe for ingestion into GEE. [Jupyter notebook]
Train random forest classifier in GEE. [Earth Engine script]

Map validation and error analysis

Aggregated CSDL versus county-level NASS statistics. [Jupyter notebook]
County-level CSDL time trends versus NASS time trends. [Jupyter notebook]
Validate CSDL against ARMS crop rotation statistics. [Jupyter notebook]
Landsat availability over the years. [Jupyter notebook]

Name	Name	Last commit message	Last commit date
Latest commit Sherrie Wang Update README.md Oct 20, 2020 dc3e40f · Oct 20, 2020 History 30 Commits
create_map	create_map	notebooks cleaned	Apr 6, 2020
data	data	updated for revision	Jun 22, 2020
results	results	added user's and producer's accuracy, README image	Jul 13, 2020
validate_map	validate_map	updated for revision	Jun 22, 2020
.gitignore	.gitignore	added user's and producer's accuracy, README image	Jul 13, 2020
LICENSE	LICENSE	Initial commit	Apr 3, 2020
README.md	README.md	Update README.md	Oct 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Corn Soy Data Layer

Dataset

Usage Notes

Code dependencies

Map creation

Map validation and error analysis

About

Releases

Packages

Languages

License

LobellLab/csdl

Folders and files

Latest commit

History

Repository files navigation

Corn Soy Data Layer

Dataset

Usage Notes

Code dependencies

Map creation

Map validation and error analysis

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages