Skip to content
View NLPShenanigans's full-sized avatar

Block or report NLPShenanigans

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The only reliable agent framework built on top of the latest OpenAI Assistants API.

Python 3,042 796 Updated Dec 26, 2024

A system for agentic LLM-powered data processing and ETL

Python 1,404 126 Updated Jan 3, 2025

A repository to capture my experiments using LLMs and Langgraph for AgenticAI

Jupyter Notebook 1 Updated Oct 11, 2024

Sloc, Cloc and Code: scc is a very fast accurate code counter with complexity calculations and COCOMO estimates written in pure Go

Go 6,917 265 Updated Jan 3, 2025

LOTUS: A semantic query engine for fast and easy LLM-powered data processing

Python 896 70 Updated Dec 27, 2024

Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".

Python 207 18 Updated Aug 25, 2024

FlockMTL: DuckDB extension to seamlessly combine analytics and semantic analysis using language models (LMs)

C++ 89 4 Updated Dec 25, 2024

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 2,815 459 Updated Jan 4, 2025

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

Zig 1,773 66 Updated Jan 3, 2025

A repository of simple Python examples for use with the PLEXOS API

Python 73 32 Updated Jun 26, 2024

A super fast Graph Database uses GraphBLAS under the hood for its sparse adjacency matrix graph representation. Our goal is to provide the best Knowledge Graph for LLM (GraphRAG).

C 817 38 Updated Jan 3, 2025

Resources on the GraphBLAS standard for graph algorithms in the language of linear algebra

189 11 Updated Oct 22, 2024

A code-first agent framework for seamlessly planning and executing data analytics tasks.

Python 5,431 694 Updated Dec 24, 2024

Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies

Python 31 2 Updated Aug 31, 2023

A massively parallel, high-level programming language

Rust 17,849 439 Updated Dec 26, 2024

Build high-performance AI models with modular building blocks

Python 451 45 Updated Dec 23, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 1,689 169 Updated Jan 4, 2025

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 12,894 888 Updated Oct 3, 2024

Comparing performance-oriented string-processing libraries for substring search, multi-pattern matching, hashing, and Levenshtein edit-distance calculations

Rust 45 3 Updated Dec 8, 2024

Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging NEON, AVX2, AVX-512, and SWAR to accelerate search, sort, edit distances, alignment scores, etc 🦖

C++ 2,319 83 Updated Dec 26, 2024

Simplify code execution with Open Interpreter UI Project with Streamlit. A user-friendly GUI for Python, JavaScript, and more. Pay-as-you-go, no subscriptions. Ideal for beginners.

Python 194 84 Updated Mar 3, 2024

[ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear attention mechanism.

Python 100 6 Updated Jun 14, 2024

A Python library modeling pedestrian and bicycle trips over networks.

Jupyter Notebook 169 12 Updated Apr 10, 2024

Streamlit text input that returns value on keyup

Python 179 23 Updated Jan 2, 2025

just a bunch of useful embeddings for scikit-learn pipelines

Python 473 15 Updated Dec 18, 2024

Track emissions from Compute and recommend ways to reduce their impact on the environment.

Python 1,218 182 Updated Jan 3, 2025

Public Fused UDFs. Build any scale workflows with the Fused Python SDK and Workbench webapp, and integrate them into your stack with the Fused Hosted API.

Python 213 43 Updated Jan 4, 2025

Code for the AISTATS 2024 Paper "From Data Imputation to Data Cleaning - Automated Cleaning of Tabular Data Improves Downstream Predictive Performance"

Python 19 2 Updated Feb 14, 2024

Easily embed, cluster and semantically label text datasets

Python 481 39 Updated Mar 28, 2024

parquet file parser for javascript

JavaScript 248 6 Updated Dec 21, 2024
Next