Skip to content
View hirobo's full-sized avatar

Block or report hirobo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Apache Camel is an open source integration framework that empowers you to quickly and easily integrate various systems consuming or producing data.

Java 5,662 4,973 Updated Jan 3, 2025
Jupyter Notebook 3,069 922 Updated Jul 9, 2024

Pluralsight course repository for Fundamentals of Integration with Apache Camel

30 32 Updated Jan 28, 2022

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,720 1,742 Updated Jan 3, 2025

Data Engineering, SQL, Exploratory Data Analysis (EDA), Machine Learning (Python), Business Intelligence (BI)

Jupyter Notebook 11 1 Updated Jul 4, 2024

Resources for the Udemy Course - Azure Data Factory For Data Engineers - Project on Covid19 by Ramesh Retnasamy

PowerShell 235 536 Updated Feb 10, 2024

[実践]データ活用システム開発ガイド スターターキット

Python 6 4 Updated Dec 2, 2022

Kafka Web UI

Java 5,651 858 Updated Jan 1, 2025

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

Scala 1,223 743 Updated May 8, 2024

組織横断的にチームを組成し、機械学習による成長サイクルを実現する計画を立てるワークショップ

Jupyter Notebook 509 52 Updated Dec 17, 2024

コードで学ぶAWS入門

Jupyter Notebook 402 41 Updated Apr 1, 2023

This repository provides a comprehensive ML infrastructure for CTR prediction, focusing on AWS services and offering practical learning experience for MLOps.

Python 61 8 Updated Jul 27, 2023

An orchestration platform for the development, production, and observation of data assets.

Python 12,188 1,528 Updated Jan 3, 2025

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 16,660 4,217 Updated Jan 3, 2025

Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

Jupyter Notebook 5,576 613 Updated Jan 3, 2025

An end-to-end implementation of intent prediction with Metaflow and other cool tools

Python 854 65 Updated Jun 16, 2023
Jupyter Notebook 3 Updated Apr 19, 2023

Assets related to the operation of Fishtown Analytics.

418 191 Updated Oct 18, 2024

Data Engineering - Metropolitan Transportation Authority (MTA) Subway Data Analysis

Jupyter Notebook 26 4 Updated Nov 25, 2024

Free MLOps course from DataTalks.Club

Jupyter Notebook 11,279 2,174 Updated Sep 9, 2024

Accelerator of Scientific Development and Research. A project template developed by XCCV group of cvpaper.challenge.

Dockerfile 410 24 Updated Aug 23, 2024
Jupyter Notebook 316 170 Updated May 8, 2023

The uncompromising Python code formatter

Python 39,392 2,487 Updated Dec 30, 2024

Code for the Data Engineering Zoomcamp

Jupyter Notebook 47 67 Updated May 7, 2023

Free Data Engineering course!

Jupyter Notebook 26,153 5,579 Updated Jan 2, 2025

Streaming Anomaly Detection Solution by using Pub/Sub, Dataflow, BQML & Cloud DLP

Java 178 48 Updated May 4, 2024

DeDRM tools for ebooks

Python 14,605 1,519 Updated Aug 20, 2024

Deskreen turns any device with a web browser into a secondary screen for your computer. ⭐️ Star to support our work!

TypeScript 17,979 995 Updated Mar 21, 2023
Next