Skip to content
View smh2019's full-sized avatar

Block or report smh2019

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Python 10,521 2,090 Updated Nov 3, 2023

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 296,692 49,326 Updated Dec 2, 2024

Reference implementation for Structured Prediction with Deep Value Networks

Jupyter Notebook 55 13 Updated Jul 10, 2017

MIT Probabilistic Computing Project software stack

Shell 1 Updated Sep 18, 2017

Downloads and archives content from reddit

Python 2,386 221 Updated Nov 17, 2024

this repository accompanies the book "Grokking Deep Learning"

Jupyter Notebook 7,556 1,600 Updated Jun 1, 2024

A curated list of awesome Bioinformatics libraries and software.

3,410 628 Updated Mar 21, 2025

Resources for learning about Text Mining and Natural Language Processing

576 199 Updated Feb 9, 2023

🌎 machine learning tutorials (mainly in Python3)

HTML 3,232 648 Updated Oct 24, 2024

A repo for data science related questions and answers

Jupyter Notebook 2,423 656 Updated Oct 6, 2022

120+ interactive Python coding interview challenges (algorithms and data structures). Includes Anki flashcards.

Python 30,151 4,530 Updated May 8, 2024

Examples of bad data, especially from government.

HTML 23 10 Updated Aug 1, 2024

The scraper, parser, and database creation scripts for Financial Management Service daily U.S. Treasury statements.

Python 105 27 Updated Dec 29, 2018

A complete computer science study plan to become a software engineer.

314,535 78,410 Updated Dec 5, 2024

Pushshift API

Python 1,332 114 Updated Apr 6, 2023

There is a continuous stream of user activity events generated from multiple users as they use our mobile Cube app. Objective is to implement a server to ingest these events. The server will expose…

Python 1 Updated May 26, 2018

Polls the Reddit API for the specified subreddit

Python 1 Updated Oct 23, 2018

Bootstrap Kubernetes the hard way. No scripts.

43,564 14,690 Updated Apr 10, 2025

Simple code for extracting data from excel sheet and Ingest into AWS S3 bucket

Python 5 2 Updated Sep 2, 2018

Predicted Bay Area bike share demand with Spark MLlib and built a pipeline to bridge Amazon S3, MongoDB server, and Spark EC2 cluster for NoSQL data processing.

Jupyter Notebook 1 Updated Jun 6, 2018

🐬 A comprehensive tutorial on getting started with Docker!

SCSS 5,801 2,191 Updated Jul 23, 2024

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials,…

Python 28,111 7,965 Updated Mar 20, 2024
Python 2 Updated Mar 14, 2018

A data pipeline to daily pull public transport data from the opentransportdata.swiss portal. This pipeline has three tasks, pull the right data from opentransportdata.swiss, push the data to s3 for…

Python 3 Updated Mar 28, 2018

A serverless data processing pipeline to store Census data in AWS S3.

JavaScript 2 1 Updated Jul 9, 2018

Scripts to download the U.S. Department of Justice's National Caseload Data and load it into Amazon Athena for querying

Python 13 3 Updated May 22, 2023

Data ingestion on AWS

HCL 1 Updated Oct 1, 2018

Data ingestion for Amazon Elasticsearch Service from S3 and Amazon Kinesis, using AWS Lambda: Sample code

JavaScript 390 173 Updated Apr 9, 2019

Website to tell visitors whether a Company is an MLM

JavaScript 7 1 Updated Jan 4, 2023
Next