Stars
Open, Multi-modal Catalog for Data & AI
OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.
A cross-platform file change monitor with multiple backends: Apple macOS File System Events, *BSD kqueue, Solaris/Illumos File Events Notification, Linux inotify, Microsoft Windows and a stat()-bas…
A command line tool to execute commands when files are modified.
Open Source Continuous File Synchronization
stdgpu: Efficient STL-like Data Structures on the GPU
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
picolibc - a C library designed for embedded 32- and 64- bit systems.
Provides GPU awareness to Spark, Contact: @kmadhugit and @kiszk
Columnar storage extension for Postgres built as a foreign data wrapper. Check out https://github.com/citusdata/citus for a modernized columnar storage implementation built as a table access method.
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
cloc counts blank lines, comment lines, and physical lines of source code in many programming languages.
Sloc, Cloc and Code: scc is a very fast accurate code counter with complexity calculations and COCOMO estimates written in pure Go
Source code counter and metrics tool for C++, C, and Java
Metrix++ is an extendable tool for code metrics collection and analysis.
Implements the [TPCH benchmark](http://www.tpc.org/tpch/) for Postgres
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
The fastest feature-rich C++11/14/17/20/23 single-header testing framework
Remove unnecessary includes from C/C++ source files