Stars
📚 Freely available programming books
Library for fast text representation and classification.
A curated list of awesome warez and piracy links
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"
Official repo for the #tidytuesday project
Sample files to accompany the FT's Chart Doctor column
Repo for Yale Applied Empirical Methods PHD Course
Parallel Computing and Scientific Machine Learning (SciML): Methods and Applications (MIT 18.337J/6.338J)
re_data - fix data issues before your users & CEO would discover them 😊
Code for a dynamic multilevel Bayesian model to predict US presidential elections. Written in R and Stan.
Code and Resources for "Feature Engineering and Selection: A Practical Approach for Predictive Models" by Kuhn and Johnson
A list of software and papers related to automatic and fast Exploratory Data Analysis
╠Slides and hands-on codes for my talk "ggplot Wizardry: My Favorite Tricks and Secrets for Beautiful Plots in R" at the 1st OutlierConf, February 4–7 2021.
Scalable graph analytics database powered by a multithreaded, vectorized temporal engine, written in Rust
A lightweight, modern and flexible, log4j and futile.logger inspired logging utility for R
An interactive free online short course on the drake R package
Class materials for "Economics, Causality, and Analytics"
The most recent version of the Applied Machine Learning notes
Masters-level applied econometrics course—focusing on prediction—at the University of Oregon (EC424/524 during Winter quarter, 2020 Taught by Ed Rubin
Translating ML into Bayes, one line at a time
sftrack: Modern classes for tracking and movement data