Stars
Benchmarks for queries over continuous data streams.
A playbook for effectively prompting post-trained LLMs
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
A curated list of Israeli-made projects, events, and individuals
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Apache Beam is a unified programming model for Batch and Streaming data processing.
Effortlessly create and manage multiple Python services and packages with this Python Mono Repo Template. Includes pre-commit hooks, GitHub Actions, Dockerfiles, and more for streamlined developmen…
Adapter for dbt that executes dbt pipelines on Apache Flink
A framework to enable multimodal models to operate a computer.
OpenTofu lets you declaratively manage your cloud infrastructure.
😎 Awesome list of tools and projects with the awesome LangChain framework
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Distributed stream processing engine in Rust
PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement
Python programs, usually short, of considerable difficulty, to perfect particular skills.
Example applications in Java, Python and SQL for Kinesis Data Analytics, demonstrating sources, sinks, and operators.
Unofficial FAQ and everything you've been wondering about Google Cloud Run.
The AmazonDynamoDBLockClient is a general purpose distributed locking library built on top of DynamoDB. It supports both coarse-grained and fine-grained locking.
Easily check your clusters for use of deprecated APIs
A data generator source connector for Flink SQL based on data-faker.
An implementation of Git in Scala 3 with ZIO 2 with all episodes available on YouTube 📺
The Scala API for Quantities, Units of Measure and Dimensional Analysis
More than 2000+ Data engineer interview questions.
✅ Highlight, list and search todo comments in your projects