-
-
dbt-docker Public
This is a dbt example you can use to populate dbt seeds, models, snapshots and tests for experimentation.
Dockerfile MIT License UpdatedSep 18, 2023 -
-
mage-ai Public
Forked from mage-ai/mage-ai🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
Python Apache License 2.0 UpdatedFeb 18, 2023 -
-
orchest Public
Forked from orchest/orchestBuild data pipelines, the easy way 🛠️
Python GNU Affero General Public License v3.0 UpdatedFeb 16, 2023 -
mara-pipelines Public
Forked from mara/mara-pipelinesA lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Python MIT License UpdatedFeb 16, 2023 -
piicatcher Public
Forked from tokern/piicatcherScan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
Python Apache License 2.0 UpdatedFeb 7, 2023 -
DataEngineeringProject Public
Forked from damklis/DataEngineeringProjectExample end to end data engineering project.
Python MIT License UpdatedDec 8, 2022 -
steam-data-engineering Public
Forked from VicenteYago/steam-data-engineeringA data engineering project with Airflow, dbt, Terrafrom, GCP and much more!
Python MIT License UpdatedNov 8, 2022 -
data-engineering-for-ml Public
Forked from GokuMohandas/data-engineeringConstruct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
Jupyter Notebook UpdatedSep 12, 2022 -
streamify Public
Forked from ankurchavda/streamifyA data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
Python UpdatedApr 16, 2022 -
portfolio_website Public
Forked from DavidCastilloAlvarado/portfolio_websiteTutorial created by Enyel Sequeira, taught by JavaScript Mastery
JavaScript UpdatedFeb 3, 2022 -
motorway Public
Forked from plecto/motorwayCloud ready pure-python streaming data pipeline library
Python Apache License 2.0 UpdatedJan 4, 2022 -
Kafka_Examples Public
Scripts and samples to support Kafka Confluent Platform with other integrations
-
MS_Data_Science Public
This repository covers the MS-Academy certification track for Data Science
Jupyter Notebook UpdatedAug 22, 2018 -
DatacampProjects Public
This repository covers the Datacamp Projects with Python Notebooks
Jupyter Notebook UpdatedAug 21, 2018