Stars
MLOps End-to-End Example using Amazon SageMaker Pipeline, AWS CodePipeline and AWS CDK
An R package to load, explore and work with the most recent V-Dem (Varieties of Democracy) dataset.
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
Up Your Bus Number: A Primer for Reproducible Data Science
The Data Science Lifecycle Process is a process for taking data science teams from Idea to Value repeatedly and sustainably. The process is documented in this repo.
A template for data analysis projects structured as R packages (or not)
Reproducible Research in R: An advanced workshop on creating collaborative and automated analysis pipelines
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
TimeGPT-1: production ready pre-trained Time Series Foundation Model for forecasting and anomaly detection. Generative pretrained transformer for time series trained on over 100B data points. It's …
ML-Ensemble – high performance ensemble learning
A multi-backend implementation of the Keras API, with support for TensorFlow, JAX, and PyTorch.
Causal Inference for the Brave and True. A light-hearted yet rigorous approach to learning about impact estimation and causality.
A curated list of awesome ggplot2 tutorials, packages etc.
A collection of learning resources for curious software engineers
Lightning ⚡️ fast forecasting with statistical and econometric models.
Code and plots for submissions to the #tidytuesday challenge
A VS Code extension pack to help users visualize, understand, and interact with data.
Python code for "Probabilistic Machine learning" book by Kevin Murphy
Practical Python Programming (course by @dabeaz)
Tutorials and training material for the H2O Machine Learning Platform
A collection of data science and machine learning resources that I've found helpful (I only post what I've read!)
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphic…
Presentation-Ready Data Summary and Analytic Result Tables
Comprehensive list of color palettes available in R ❤️🧡💛💚💙💜
Official repo for the #tidytuesday project