Stars
A native Delta implementation for integration with any query engine
Goose is a developer agent that operates from your command line to help you do the boring stuff.
Example multi-region AWS Terraform application
The Security Reference Architecture (SRA) implements typical security features as Terraform Templates that are deployed by most high-security organizations, and enforces controls for the largest ri…
Open, Multi-modal Catalog for Data & AI
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
📚 Personal collection of ChatGPT prompts for developers!
🎨 Diagram as Code for prototyping cloud system architectures
All the bufo emojis you could possibly ask for
Examples of using Terraform to deploy Databricks resources
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
Work with your web service, database, and streaming schemas in a single format.
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
Standard and Advanced Demos for learn.cantrill.io courses
A tool (and pre-commit hook) to automatically upgrade syntax for newer versions of the language.
A tool for refurbishing and modernizing Python codebases
A list of useful resources to learn Data Engineering from scratch
MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.
A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.
CLI tool to generate terraform files from existing infrastructure (reverse Terraform). Infrastructure to Code
Self-contained examples using Apache Spark with the functional features of Java 8
Self-contained examples of Apache Spark streaming integrated with Apache Kafka.