Stars
⚡ A collection of resources and tutorials to design a better database schema.
The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such as Apache Airflow, AWS Redshift and Power BI.
📖 R 语言数据分析实战(写作中) Data Analysis in Action Using R
The best place to learn data engineering. Built and maintained by the data engineering community.
A repository of Applied Data Science materials.
This repository contains supplemental materials for the NYU Schack Institute of Real Estate course in Real Estate Finance.
A curated list of awesome Anki add-ons, decks and resources
Decks of Anki flashcards shared publicly from my side.
A Full-Stack Authentication App With React, Express, and MongoDB
Complete solution for login and registration in React apps using express and oauth2 as an authentication backend.
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
A project to design a fact and dimension star schema for optimizing queries on a flight booking database using PostgreSQL, a relational database management system. This schema is well-suited for a …
Create Data Lake on AWS S3 to store dimensional tables after processing data using Spark on AWS EMR cluster
Personal Data Engineering Projects
Learn how to design large-scale systems. Prep for the system design interview.
Repo for: Predicting Airbnb prices - IBM's Data Science Professional Certificate Capstone
[BA project] Dynamic Pricing Optimization for Airbnb listing to optimize yearly profit for host. Use Clustering for competitive analysis, kNN regression for demand forecasting, and find dynamic opt…
💡 LeetCode in C++20/Java/Python/MySQL/TypeScript (respect coding conventions)
My solution to the book <A collection of Data Science Take-home Challenges>
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Displacement Risk Index Indicator Calculations
An assessment of displacement risk
The Urban Displacement Project's Displacement Typology Map code
The goal of this project is to illustrate Extract Transform Load (ETL) using Python and SQL. ETL is a process commonly done in computing, which takes raw data, cleans it and stores it for later use…
This project repository provides a headless module to enrich location data in a database table using the Google Maps Geocode API.
Compilation of resources for aspiring data scientists