Lists (8)
Sort Name ascending (A-Z)
APIs/Wrappers
Books & Code
Repositories containing code from books (e.g., O'Reilly)Cassandra + Flask
Data Engineering Projects
Data Structures and Algorithms
Guides/Manuals/Handbooks
Templates
UofT Msc Statistics
Stars
A curated list of awesome places to learn and/or practice algorithms.
Resources to Learn Data Structures and Algorithms, ace competitive programming, Get a Job in Tech/CS
Final Project for STA2101 Methods of Applied Statistics at the University of Toronto
📖 A collection of pure bash alternatives to external processes.
This is a repo with links to everything you'd ever want to learn about data engineering
Curated Data Science resources (Free & Paid) to help aspiring and experienced data scientists learn, grow, and advance their careers.
A curated list of insanely awesome libraries, packages and resources for Quants (Quantitative Finance)
This repository shows how to stand a three-node Cassandra cluster and Flask webserver to display the data in the Cassandra database. The scripts in the init-scripts folder creates a keyspace called…
Beautiful Interactive tables in your Flask templates.
Starting with Cassandra on Python Flask
Pull current and historical baseball statistics using Python (Statcast, Baseball Reference, FanGraphs)
A Python API to retrieve and read MLB GameDay data
This project repository provides a headless module to enrich location data in a database table using the Google Maps Geocode API.
Unofficial packages for Ring Doorbells, Cameras, Alarm System, and Smart Lighting
Implementing best practices for PySpark ETL jobs and applications.
Generic ETL Pipeline Framework for Apache Spark
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…
Extract, Transform, Load: Any SQL Database in 4 lines of Code.
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
One framework to develop, deploy and operate data workflows with Python and SQL.