Stars
Apache Camel is an open source integration framework that empowers you to quickly and easily integrate various systems consuming or producing data.
Pluralsight course repository for Fundamentals of Integration with Apache Camel
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Data Engineering, SQL, Exploratory Data Analysis (EDA), Machine Learning (Python), Business Intelligence (BI)
Resources for the Udemy Course - Azure Data Factory For Data Engineers - Project on Covid19 by Ramesh Retnasamy
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
組織横断的にチームを組成し、機械学習による成長サイクルを実現する計画を立てるワークショップ
This repository provides a comprehensive ML infrastructure for CTR prediction, focusing on AWS services and offering practical learning experience for MLOps.
An orchestration platform for the development, production, and observation of data assets.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
An end-to-end implementation of intent prediction with Metaflow and other cool tools
Data Engineering - Metropolitan Transportation Authority (MTA) Subway Data Analysis
Free MLOps course from DataTalks.Club
Accelerator of Scientific Development and Research. A project template developed by XCCV group of cvpaper.challenge.
Code for the Data Engineering Zoomcamp
Free Data Engineering course!
Streaming Anomaly Detection Solution by using Pub/Sub, Dataflow, BQML & Cloud DLP
Deskreen turns any device with a web browser into a secondary screen for your computer. ⭐️ Star to support our work!