Skip to content
View SourabhKr's full-sized avatar
  • Berlin

Block or report SourabhKr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🪄 Create rich visualizations with AI

TypeScript 10,262 797 Updated Mar 21, 2025

📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics

Rust 18,364 1,819 Updated Mar 21, 2025

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

18,225 2,319 Updated Mar 18, 2025

A curated list of awesome Machine Learning frameworks, libraries and software.

Python 67,268 14,818 Updated Feb 13, 2025

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

19,826 2,743 Updated Mar 20, 2025

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 27,104 5,543 Updated Feb 22, 2025

Implementing the 4 agentic patterns from scratch

Jupyter Notebook 1,121 139 Updated Mar 18, 2025

This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…

Jupyter Notebook 9,189 1,187 Updated Mar 20, 2025

NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…

Python 2,612 226 Updated Mar 20, 2025

This package contains macros and models to find DAG issues automatically

Shell 468 74 Updated Jan 28, 2025

Using a pre-commit hook, Talisman validates the outgoing changeset for things that look suspicious — such as tokens, passwords, and private keys.

Go 1,969 246 Updated Mar 20, 2025
Jupyter Notebook 7,057 1,190 Updated Sep 22, 2024

⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.

PLpgSQL 41 11 Updated Jan 8, 2025

Free, simple, and intuitive online database diagram editor and SQL generator.

JavaScript 26,140 1,852 Updated Mar 20, 2025

Code for "Efficient Data Processing in Spark" Course

Python 287 59 Updated Oct 1, 2024

A tool for exploring each layer in a docker image

Go 49,949 1,860 Updated Mar 18, 2025

A curated list of awesome blogs, videos, tools and resources about Data Contracts

172 20 Updated Aug 12, 2024

Open, Multi-modal Catalog for Data & AI

Python 2,737 451 Updated Mar 19, 2025

Understanding Deep Learning - Simon J.D. Prince

Jupyter Notebook 7,277 1,549 Updated Mar 6, 2025

Shows how the CFT modules can be composed to build a secure cloud foundation

HCL 1,294 727 Updated Mar 20, 2025

Deploys a secured BigQuery data warehouse

HCL 82 37 Updated Mar 12, 2025

A collection of learning resources for curious software engineers

Python 47,428 3,767 Updated Mar 16, 2025

1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java

Java 6,826 1,991 Updated Aug 20, 2024

Dataform is a framework for managing SQL based data operations in BigQuery

TypeScript 882 173 Updated Mar 13, 2025

This Dataform project processes various marketing data sources and creates a Marketing Data Store (MDS) to be used in several use cases: a)retain historical marketing data; b)create high performanc…

JavaScript 65 40 Updated Dec 13, 2024

📚 Tech blogs & talks by companies that run Apache Flink in production

167 12 Updated Jan 24, 2025

A comprehensive list of books on Software Architecture.

10,089 805 Updated Mar 15, 2023

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,890 1,800 Updated Mar 20, 2025

Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering.

Jupyter Notebook 29,517 6,290 Updated Mar 17, 2025

A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!

Python 688 143 Updated Apr 16, 2022
Next