A repo for teaching data science to high school students
verson 1: Apr 18, 2020.
https://github.com/aayancheng/HighSchoolDataScience | video introduction
There are 3 notebooks (about 30 mins each):
There are 2 ways to run the notebooks:
- Easy and no installation. Simply Click the button to launch a binder session.
- More flexibile and require installation. Install Jupyter Lab on your PC via Anaconda package and copy the codes to run in your local session.
- Gap Minder: https://demo.bokeh.org/gapminder
- Kaggle Data: https://www.kaggle.com/datasets?sort=votes
- Paid courses:
- Linear Regression: https://www.ritchieng.com/machine-learning-evaluate-linear-regression-model
- Logistic Regression: https://www.datacamp.com/community/tutorials/understanding-logistic-regression-python
- Paid courses:
- Predicting Credit Card Approvals: https://www.datacamp.com/projects/558
- Linear Modeling in Python: https://learn.datacamp.com/courses/introduction-to-linear-modeling-in-python
- MNIST Images (https://www.tensorflow.org/overview/)
- Fashion MNIST: https://towardsdatascience.com/mnist-cnn-python-c61a5bce7a19
- Paid Courses:
- ASL Recognition with Deep Learning: https://www.datacamp.com/projects/509
- Extract Stock Sentiment from News Headlines: https://www.datacamp.com/projects/611
A few places to start:
- Foundation of Data Science at Berkeley http://data8.org/
- MIT introduction to deep learning: http://introtodeeplearning.com/2019/index.html
- Deep Learning: https://lilianweng.github.io/lil-log/2017/06/21/an-overview-of-deep-learning.html
- Deep Learning and the Game of Go: https://www.amazon.ca/gp/product/1617295329/ref=ox_sc_act_title_1?smid=A3DWYIK6Y9EEQB&psc=1