Stars
Feasible cost konfigurable NAT: An AWS NAT Instance AMI
A library for building fast, reliable and evolvable network services.
Apache DataFusion Comet Spark Accelerator
Implementation of Apache ORC file format use Apache Arrow in-memory format
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…
JunoDB is PayPal's home-grown secure, consistent and highly available key-value store providing low, single digit millisecond, latency at any scale.
Convert sequences of Rust objects to Arrow tables
The OpenTF Manifesto expresses concern over HashiCorp's switch of the Terraform license from open-source to the Business Source License (BSL) and calls for the tool's return to a truly open-source …
PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement
Python bindings and arrow integration for the rust object_store crate.
Apache HoraeDB (incubating) is a high-performance, distributed, cloud native time-series database.
ParseableDB is a disk less, cloud native database for observability and security. Parseable is the Observability platform built with ParseableDB
Cookbook with recipes for datafusion
Demonstration of how to use the rust object_store crate https://crates.io/crates/object_store
Batteries included CLI, TUI, and server implementations for DataFusion.
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
A highly efficient daemon for streaming data from Kafka into Delta Lake
A native Rust library for Delta Lake, with bindings into Python
Tools for concurrent programming in Rust
DuckDB is an analytical in-process SQL database management system
Auto-generate serde implementations for prost types