Source: https://archive.ics.uci.edu/ml/machine-learning-databases/adult/
Source: https://archive.ics.uci.edu/ml/machine-learning-databases/statlog/german/
Source: https://github.com/propublica/compas-analysis
Source: https://ww2.amstat.org/publications/jse/v18n3/RicciData.csv
Citation: Miao, Weiwen. "Did the results of promotion exams have a disparate impact on minorities? Using statistical evidence in Ricci v. DeStefano." Journal of Statistics Education 18.3 (2010): 1-26. http://ww2.amstat.org/publications/jse/v18n3/miao.pdf
To add a data set, you need to:
- Choose a single word lower case name to identify your data set.
- Put the raw data set in the raw/ directory at name.csv. Add any data info at name.txt.
- Create a class Name.py that extends Data.py and implements all the required methods and fill in the fields. Add it to objects/
- Add your dataset object to the list at objects/list.py
All preprocessed versions of the data should be committed to the preprocessed directory. To regenerate them, run:
python3 preprocess.py