Skip to content
View SourabhKr's full-sized avatar
  • Berlin

Block or report SourabhKr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An extremely fast Python package and project manager, written in Rust.

Rust 46,854 1,319 Updated Mar 29, 2025

🪄 Create rich visualizations with AI

TypeScript 11,008 846 Updated Mar 28, 2025

📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics

Rust 18,399 1,822 Updated Mar 28, 2025

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

18,262 2,324 Updated Mar 26, 2025

A curated list of awesome Machine Learning frameworks, libraries and software.

Python 67,393 14,833 Updated Feb 13, 2025

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

19,864 2,747 Updated Mar 27, 2025

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 27,216 5,561 Updated Feb 22, 2025

Implementing the 4 agentic patterns from scratch

Jupyter Notebook 1,147 150 Updated Mar 18, 2025

This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…

Jupyter Notebook 10,733 1,362 Updated Mar 27, 2025

NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…

Python 2,629 227 Updated Mar 28, 2025

This package contains macros and models to find DAG issues automatically

Shell 469 75 Updated Mar 22, 2025

Using a pre-commit hook, Talisman validates the outgoing changeset for things that look suspicious — such as tokens, passwords, and private keys.

Go 1,973 249 Updated Mar 28, 2025
Jupyter Notebook 7,132 1,207 Updated Sep 22, 2024

⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.

PLpgSQL 41 11 Updated Jan 8, 2025

Free, simple, and intuitive online database diagram editor and SQL generator.

JavaScript 26,272 1,857 Updated Mar 24, 2025

Code for "Efficient Data Processing in Spark" Course

Python 289 59 Updated Oct 1, 2024

A tool for exploring each layer in a docker image

Go 50,051 1,858 Updated Mar 28, 2025

A curated list of awesome blogs, videos, tools and resources about Data Contracts

172 20 Updated Aug 12, 2024

Open, Multi-modal Catalog for Data & AI

Python 2,759 455 Updated Mar 28, 2025

Understanding Deep Learning - Simon J.D. Prince

Jupyter Notebook 7,308 1,550 Updated Mar 28, 2025

Shows how the CFT modules can be composed to build a secure cloud foundation

HCL 1,297 729 Updated Mar 27, 2025

Deploys a secured BigQuery data warehouse

HCL 82 38 Updated Mar 12, 2025

A collection of learning resources for curious software engineers

Python 47,463 3,768 Updated Mar 28, 2025

1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java

Java 6,852 1,995 Updated Aug 20, 2024

Dataform is a framework for managing SQL based data operations in BigQuery

TypeScript 883 173 Updated Mar 25, 2025

This Dataform project processes various marketing data sources and creates a Marketing Data Store (MDS) to be used in several use cases: a)retain historical marketing data; b)create high performanc…

JavaScript 66 40 Updated Dec 13, 2024

📚 Tech blogs & talks by companies that run Apache Flink in production

167 12 Updated Jan 24, 2025

A comprehensive list of books on Software Architecture.

10,099 809 Updated Mar 15, 2023

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,912 1,800 Updated Mar 29, 2025

Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering.

Jupyter Notebook 29,719 6,331 Updated Mar 28, 2025
Next