Skip to content
View aarnphm's full-sized avatar
:shipit:
\xff\x88
:shipit:
\xff\x88

Block or report aarnphm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

exploration WYSIWYG editor

Markdown 6 Updated Apr 1, 2025

🇨🇭 A React renderer for Three.js

TypeScript 28,551 1,664 Updated Mar 31, 2025

The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)

Rust 2,803 110 Updated Apr 1, 2025

seqax = sequence modeling + JAX

Python 151 14 Updated Mar 17, 2025

libxev is a cross-platform, high-performance event loop that provides abstractions for non-blocking IO, timers, events, and more and works on Linux (io_uring or epoll), macOS (kqueue), and Wasm + W…

Zig 2,622 111 Updated Mar 14, 2025

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Rust 11,140 809 Updated Apr 1, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 3,456 244 Updated Apr 1, 2025

High-performance safetensors model loader

Python 17 4 Updated Mar 28, 2025

Super-fast Structured Outputs

Rust 176 23 Updated Mar 31, 2025

QwQ is the reasoning model series developed by Qwen team, Alibaba Cloud.

Python 411 10 Updated Mar 27, 2025

Multiplayer at the speed of light

Rust 11,032 378 Updated Apr 1, 2025

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Python 516 41 Updated Jan 28, 2025

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 824 55 Updated Mar 19, 2025

The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.

TypeScript 11,520 559 Updated Apr 1, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 10,766 722 Updated Apr 1, 2025

🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.

C++ 687 125 Updated Apr 1, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 8,450 825 Updated Mar 30, 2025

Expert Parallelism Load Balancer

Python 1,116 180 Updated Mar 24, 2025

Sparsify transformers with SAEs and transcoders

Python 499 66 Updated Mar 28, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,397 813 Updated Mar 1, 2025

A small, fast, pure JavaScript type-stripper that uses the official TypeScript parser.

TypeScript 705 17 Updated Mar 3, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,957 232 Updated Mar 4, 2025

Genome modeling and design across all domains of life

Jupyter Notebook 2,630 265 Updated Mar 20, 2025

peer-2-peer that just works

Rust 4,427 222 Updated Mar 31, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,970 517 Updated Apr 1, 2025

Modern HTTP benchmarking tool

C 38,665 2,976 Updated Dec 30, 2023

KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes

Go 8,909 1,122 Updated Apr 1, 2025

Scan for React performance issues and eliminate slow renders in your app

TypeScript 17,418 259 Updated Mar 28, 2025

Self-host LLMs with vLLM and BentoML

Python 97 15 Updated Mar 27, 2025

🤗 smolagents: a barebones library for agents that think in python code.

Python 16,179 1,434 Updated Apr 1, 2025
Next