#

etl-pipelines

Here are 14 public repositories matching this topic...

patterns-app / patterns-devkit

Data pipelines from re-usable components

data-science sql etl pipelines immutability data-engineering functional-reactive-programming data-analysis data-pipelines data-pipeline etl-framework etl-pipeline etl-pipelines

Updated Mar 30, 2023
Python

yobix-ai / extractous

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

nlp rust pdf machine-learning natural-language-processing ocr etl tika extraction docx data-pipelines pdf-parser unstructured unstructured-data rag etl-pipelines llm

Updated Oct 3, 2024
Rust

useful

level-vc / useful

The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.

etl telemetry etl-pipelines python-observability

Updated Jan 11, 2024
Python

Chek0rrdn / DataEngineer_ETL

A project structure for doing and sharing data engineer work.

scraper etl cookiecutter python3 data-engineering data-extraction cookiecutter-template etl-pipeline etl-pipe etl-pipelines

Updated Feb 28, 2022
Python

ChristianRCanlas / ChristianRCanlas.github.io

e-Portfolio showcasing my personal projects.

Updated Jul 23, 2024
Python

angelxd84130 / Airflow-ETL

Build ETL piplines on AirFlow to load data from BigQuery and store it in MySQL

mysql bigquery airflow etl apache-airflow etl-pipeline airflow-dags etl-pipelines

Updated Aug 1, 2022
Python

prneidhardt / Apache-Data-Pipeline

Sparkify project

python aws airflow-dags etl-pipelines

Updated Jul 10, 2023
Jupyter Notebook

EmmanuelEzenwere / AutoDATA-prep

AutoDS-Prep automates the data pre-processing step of Data Science Projects.

data-science data-engineering data-preprocessing etl-pipelines

Updated May 28, 2024
Python

juniors90 / PymaciesArg

An extension that registers all pharmacies in Argentina.

python datascience argentina pharmacy etl-framework etl-pipeline etl-job pharmacies pypi-package etl-automation etl-pipelines

Updated Oct 16, 2022
Python

siddarthaThentu / Disaster-Response-Pipeline

A deployed machine learning model that has the capability to automatically classify the incoming disaster messages into related 36 categories. Project developed as a part of Udacity's Data Science Nanodegree program.

bootstrap flask machine-learning plotly python3 data-analytics hyperparameter-optimization feature-engineering ensemble-models ml-pipelines etl-pipelines

Updated Jun 10, 2021
Python

Guilherme-B / baboon

JSON-driven ETL pipeline framework prototype

json dag bonobo etl-pipelines

Updated Mar 25, 2020
Python

omar-elmaria / airflow_local

This repo contains the DAGs that run on my local Airflow environment. I use the local environment to test my DAGs before deploying them to virtual machines via Kubernetes

python airflow automation orchestration dags etl-pipelines

Updated Oct 30, 2022
Python

extralo / loom

Weaving together different threads (services like image/audio converse, ETL services, etc.) to enable the World Wide Flow

etl-framework etl-pipelines flow-architectures

Updated Dec 26, 2023
JavaScript

speedbits / LimitlessETL

A Python and Spark based ETL framework. While it operates within speed limits that is framework and standards, but offers boundless possibilities.

etl etl-framework etl-pipeline etl-job etl-pipelines

Updated Apr 1, 2024
Python

Improve this page

Add a description, image, and links to the etl-pipelines topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the etl-pipelines topic, visit your repo's landing page and select "manage topics."