Stars
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
The data-validation toolkit for enhanced dbt (data build tool) PR review
✍️ dbt doc generator for advanced data teams
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool f…
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
tbls is a CI-Friendly tool for document a database, written in Go.
A curated list of awesome posts, videos, and articles on leading a data team (small and large)
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement
Malloy is an experimental language for describing data relationships and transformations.
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
An Open Standard for lineage metadata collection
Advent of Code 2021 using SQL (PostgreSQL-flavored)
A Go-based static site generator that compiles brandur.org.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Python library providing function decorators for configurable backoff and retry
A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.
TensorRT MODNet, YOLOv4, YOLOv3, SSD, MTCNN, and GoogLeNet
Ready-to-use SRT / WebRTC / RTSP / RTMP / LL-HLS media server and media proxy that allows to read, publish, proxy, record and playback video and audio streams.
Easily generate flowcharts and diagrams from text ⿻
This repository contains the code base for the Open Stream Processing Benchmark.
Analyze treatment differences of certain groups of FLOSS contributors based on their issues and PRs.
Quantified Self Personal Data Aggregator and Data Analysis