Skip to content
View dansinh's full-sized avatar

Block or report dansinh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MLOps End-to-End Example using Amazon SageMaker Pipeline, AWS CodePipeline and AWS CDK

TypeScript 139 88 Updated Feb 11, 2025

The uncompromising Python code formatter

Python 39,898 2,554 Updated Mar 6, 2025

An R package to load, explore and work with the most recent V-Dem (Varieties of Democracy) dataset.

R 118 24 Updated Mar 17, 2024

A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin

6,303 632 Updated Mar 10, 2025

Up Your Bus Number: A Primer for Reproducible Data Science

Jupyter Notebook 68 21 Updated Feb 23, 2019

The Data Science Lifecycle Process is a process for taking data science teams from Idea to Value repeatedly and sustainably. The process is documented in this repo.

503 72 Updated May 4, 2021

Sample projects using Ploomber.

Jupyter Notebook 86 25 Updated Jan 25, 2024

A template for data analysis projects structured as R packages (or not)

R 175 23 Updated May 24, 2021

Reproducible Research in R: An advanced workshop on creating collaborative and automated analysis pipelines

TeX 4 1 Updated Feb 26, 2025

A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.

Python 8,627 2,501 Updated Mar 10, 2025

TimeGPT-1: production ready pre-trained Time Series Foundation Model for forecasting and anomaly detection. Generative pretrained transformer for time series trained on over 100B data points. It's …

Jupyter Notebook 2,626 215 Updated Mar 5, 2025

ML-Ensemble – high performance ensemble learning

Python 851 108 Updated Nov 13, 2023

A multi-backend implementation of the Keras API, with support for TensorFlow, JAX, and PyTorch.

Python 1,267 117 Updated Jul 25, 2024

Causal Inference for the Brave and True. A light-hearted yet rigorous approach to learning about impact estimation and causality.

Jupyter Notebook 2,869 513 Updated Dec 9, 2024

A curated list of awesome ggplot2 tutorials, packages etc.

1,630 173 Updated Feb 27, 2025

A collection of learning resources for curious software engineers

Python 47,401 3,759 Updated Mar 8, 2025

Lightning ⚡️ fast forecasting with statistical and econometric models.

Python 4,179 302 Updated Mar 3, 2025

rstudio::conf(2022, "program")

R 60 54 Updated Aug 23, 2022

Code and plots for submissions to the #tidytuesday challenge

R 716 115 Updated Mar 9, 2025

A VS Code extension pack to help users visualize, understand, and interact with data.

567 26 Updated May 10, 2021

Python code for "Probabilistic Machine learning" book by Kevin Murphy

Jupyter Notebook 6,679 1,549 Updated Nov 26, 2024

Practical Python Programming (course by @dabeaz)

Python 10,113 6,706 Updated Aug 10, 2024

Tutorials and training material for the H2O Machine Learning Platform

Jupyter Notebook 1,488 1,000 Updated Oct 24, 2024

A collection of data science and machine learning resources that I've found helpful (I only post what I've read!)

533 121 Updated Jul 8, 2024

DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphic…

Python 7,337 945 Updated Mar 10, 2025

Presentation-Ready Data Summary and Analytic Result Tables

R 1,088 129 Updated Mar 11, 2025

Comprehensive list of color palettes available in R ❤️🧡💛💚💙💜

R 1,560 140 Updated Aug 15, 2024

Official repo for the #tidytuesday project

HTML 7,223 2,438 Updated Mar 10, 2025

Themes for ggplot2.

R 900 108 Updated May 7, 2022