A super fast Graph Database uses GraphBLAS under the hood for its sparse adjacency matrix graph representation. Our goal is to provide the best Knowledge Graph for LLM (GraphRAG).

C 817 38 Updated Jan 3, 2025

GraphBLAS / GraphBLAS-Pointers

Resources on the GraphBLAS standard for graph algorithms in the language of linear algebra

189 11 Updated Oct 22, 2024

microsoft / TaskWeaver

A code-first agent framework for seamlessly planning and executing data analytics tasks.

Python 5,431 694 Updated Dec 24, 2024

prrao87 / duckdb-study

Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies

Python 31 2 Updated Aug 31, 2023

HigherOrderCO / Bend

A massively parallel, high-level programming language

Rust 17,849 439 Updated Dec 26, 2024

kyegomez / zeta

Build high-performance AI models with modular building blocks

Python 451 45 Updated Dec 23, 2024

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 1,689 169 Updated Jan 4, 2025

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 12,894 888 Updated Oct 3, 2024

ashvardanian / stringzilla-benchmarks-rs

Comparing performance-oriented string-processing libraries for substring search, multi-pattern matching, hashing, and Levenshtein edit-distance calculations

Rust 45 3 Updated Dec 8, 2024

ashvardanian / StringZilla

Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging NEON, AVX2, AVX-512, and SWAR to accelerate search, sort, edit distances, alignment scores, etc 🦖

C++ 2,319 83 Updated Dec 26, 2024

blazzbyte / OpenInterpreterUI

Simplify code execution with Open Interpreter UI Project with Streamlit. A user-friendly GUI for Python, JavaScript, and more. Pay-as-you-go, no subscriptions. Ideal for beginners.

Python 194 84 Updated Mar 3, 2024

YuchuanTian / DiJiang

[ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear attention mechanism.

Python 100 6 Updated Jun 14, 2024

City-Form-Lab / madina

A Python library modeling pedestrian and bicycle trips over networks.

Jupyter Notebook 169 12 Updated Apr 10, 2024

blackary / streamlit-keyup

Streamlit text input that returns value on keyup

Python 179 23 Updated Jan 2, 2025

koaning / embetter

just a bunch of useful embeddings for scikit-learn pipelines

Python 473 15 Updated Dec 18, 2024

mlco2 / codecarbon

Track emissions from Compute and recommend ways to reduce their impact on the environment.

Python 1,218 182 Updated Jan 3, 2025

fusedio / udfs

Public Fused UDFs. Build any scale workflows with the Fused Python SDK and Workbench webapp, and integrate them into your stack with the Fused Hosted API.

Python 213 43 Updated Jan 4, 2025

se-jaeger / conformal-data-cleaning

Code for the AISTATS 2024 Paper "From Data Imputation to Data Cleaning - Automated Cleaning of Tabular Data Improves Downstream Predictive Performance"

Python 19 2 Updated Feb 14, 2024