libxev is a cross-platform, high-performance event loop that provides abstractions for non-blocking IO, timers, events, and more and works on Linux (io_uring or epoll), macOS (kqueue), and Wasm + W…

Zig 2,622 111 Updated Mar 14, 2025

block / goose

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Rust 11,140 809 Updated Apr 1, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 3,456 244 Updated Apr 1, 2025

foundation-model-stack / fastsafetensors

High-performance safetensors model loader

Python 17 4 Updated Mar 28, 2025

guidance-ai / llguidance

Super-fast Structured Outputs

Rust 176 23 Updated Mar 31, 2025

QwenLM / QwQ

QwQ is the reasoning model series developed by Qwen team, Alibaba Cloud.

Python 411 10 Updated Mar 27, 2025

clockworklabs / SpacetimeDB

Multiplayer at the speed of light

Rust 11,032 378 Updated Apr 1, 2025

likenneth / honest_llama

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Python 516 41 Updated Jan 28, 2025

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 824 55 Updated Mar 19, 2025

mastra-ai / mastra

The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.

TypeScript 11,520 559 Updated Apr 1, 2025

allenai / olmocr

Toolkit for linearizing PDFs for LLM datasets/training

Python 10,766 722 Updated Apr 1, 2025

tenstorrent / tt-metal

🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.

C++ 687 125 Updated Apr 1, 2025

deepseek-ai / 3FS

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 8,450 825 Updated Mar 30, 2025

deepseek-ai / EPLB

Expert Parallelism Load Balancer

Python 1,116 180 Updated Mar 24, 2025

EleutherAI / sparsify

Sparsify transformers with SAEs and transcoders

Python 499 66 Updated Mar 28, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

C++ 11,397 813 Updated Mar 1, 2025

bloomberg / ts-blank-space

A small, fast, pure JavaScript type-stripper that uses the official TypeScript parser.

TypeScript 705 17 Updated Mar 3, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,957 232 Updated Mar 4, 2025

ArcInstitute / evo2

Genome modeling and design across all domains of life

Jupyter Notebook 2,630 265 Updated Mar 20, 2025

n0-computer / iroh

peer-2-peer that just works

Rust 4,427 222 Updated Mar 31, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,970 517 Updated Apr 1, 2025

wg / wrk

Modern HTTP benchmarking tool

C 38,665 2,976 Updated Dec 30, 2023

kedacore / keda

KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes

Go 8,909 1,122 Updated Apr 1, 2025

aidenybai / react-scan

Scan for React performance issues and eliminate slow renders in your app

TypeScript 17,418 259 Updated Mar 28, 2025

bentoml / BentoVLLM

Self-host LLMs with vLLM and BentoML

Python 97 15 Updated Mar 27, 2025

huggingface / smolagents

🤗 smolagents: a barebones library for agents that think in python code.

Python 16,179 1,434 Updated Apr 1, 2025

scikit-learn

Shell

Security

Serverless

Sass

Rust

Redux

React Native

ReactiveUI

React

See all starred topics

Aaron Pham aarnphm

Highlights

Lists (3)

ai-apps

🔮 Future ideas

research

Starred repositories