Skip to content
View reiase's full-sized avatar

Block or report reiase

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Multi-platform high-performance compute language extension for Rust.

Rust 872 47 Updated Feb 26, 2025

A no_std + serde compatible message library for Rust

Rust 1,032 98 Updated Feb 11, 2025

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

C++ 231 8 Updated Feb 23, 2025
Rust 21 3 Updated Feb 25, 2025

Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

Rust 54,838 3,615 Updated Feb 27, 2025

An easy to use leptos component library

Rust 396 52 Updated Feb 27, 2025

Material for gpu-mode lectures

Jupyter Notebook 3,827 389 Updated Feb 9, 2025

A PyTorch native library for large model training

Python 3,357 293 Updated Feb 27, 2025

Analyze ELF binaries like a boss 😼🕵️‍♂️

Rust 3,016 71 Updated Feb 24, 2025
Rust 7 3 Updated Feb 27, 2025

Servo aims to empower developers with a lightweight, high-performance alternative for embedding web technologies in applications.

Rust 29,397 3,088 Updated Feb 27, 2025

A web browser that plays old world blues to build new world hope

Rust 4,855 156 Updated Feb 27, 2025

A convenient LD_PRELOAD hooker

Rust 12 2 Updated Feb 26, 2022

pytrace is a fast python tracer. it records function calls, arguments and return values. can be used for debugging and profiling.

C 447 21 Updated Apr 18, 2016

Collection of crates to deal with crashes

Rust 145 14 Updated Aug 2, 2024

Implementation for MatMul-free LM.

Python 2,964 187 Updated Nov 5, 2024

A debugging toolset and library for debugging embedded ARM and RISC-V targets on a separate host

Rust 1,975 408 Updated Feb 27, 2025

Schedule-Free Optimization in PyTorch

Python 2,101 71 Updated Dec 2, 2024

how to optimize some algorithm in cuda.

Cuda 1,923 168 Updated Feb 26, 2025

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 14,202 1,164 Updated May 23, 2024

[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

C++ 551 33 Updated Feb 21, 2025

A scalable, distributed, collaborative, document-graph database, for the realtime web

Rust 28,777 961 Updated Feb 26, 2025

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 983 151 Updated Feb 18, 2025

A PyTorch Native LLM Training Framework

Python 736 41 Updated Dec 27, 2024

A library for building fast, reliable and evolvable network services.

Rust 23,324 1,320 Updated Feb 22, 2025

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 7,007 1,939 Updated Feb 26, 2025

LLM training in simple, raw C/CUDA

Cuda 25,819 2,958 Updated Oct 2, 2024

A GPU-driven system framework for scalable AI applications

C++ 112 17 Updated Feb 5, 2025

PyTorch emulation library for Microscaling (MX)-compatible data formats

Python 202 30 Updated Sep 23, 2024
Next