reiase

Follow

Reiase reiase

Follow

20 followers · 13 following

Shanghai

Achievements

Achievements

Lists (1)

Sort

🚀 My stack

Stars

tracel-ai / cubecl

Multi-platform high-performance compute language extension for Rust.

Rust 872 47 Updated Feb 26, 2025

jamesmunns / postcard

A no_std + serde compatible message library for Rust

Rust 1,032 98 Updated Feb 11, 2025

HanGuo97 / flute

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

C++ 231 8 Updated Feb 23, 2025

n0-computer / iroh-gossip

Rust 21 3 Updated Feb 25, 2025

deepseek-ai / DeepSeek-V3

Python 89,591 14,441 Updated Feb 24, 2025

zed-industries / zed

Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

Rust 54,838 3,615 Updated Feb 27, 2025

thaw-ui / thaw

An easy to use leptos component library

Rust 396 52 Updated Feb 27, 2025

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 3,827 389 Updated Feb 9, 2025

pytorch / torchtitan

A PyTorch native library for large model training

Python 3,357 293 Updated Feb 27, 2025

orhun / binsider

Analyze ELF binaries like a boss 😼🕵️‍♂️

Rust 3,016 71 Updated Feb 24, 2025

reiase / probing

Rust 7 3 Updated Feb 27, 2025

servo / servo

Servo aims to empower developers with a lightweight, high-performance alternative for embedding web technologies in applications.

Rust 29,397 3,088 Updated Feb 27, 2025

versotile-org / verso

A web browser that plays old world blues to build new world hope

Rust 4,855 156 Updated Feb 27, 2025

koute / hooky

A convenient LD_PRELOAD hooker

Rust 12 2 Updated Feb 26, 2022

alonho / pytrace

pytrace is a fast python tracer. it records function calls, arguments and return values. can be used for debugging and profiling.

C 447 21 Updated Apr 18, 2016

EmbarkStudios / crash-handling

Collection of crates to deal with crashes

Rust 145 14 Updated Aug 2, 2024

ridgerchu / matmulfreellm

Implementation for MatMul-free LM.

Python 2,964 187 Updated Nov 5, 2024

probe-rs / probe-rs

A debugging toolset and library for debugging embedded ARM and RISC-V targets on a separate host

Rust 1,975 408 Updated Feb 27, 2025

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 2,101 71 Updated Dec 2, 2024

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 1,923 168 Updated Feb 26, 2025

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 14,202 1,164 Updated May 23, 2024

mit-han-lab / omniserve

[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

C++ 551 33 Updated Feb 21, 2025

surrealdb / surrealdb

A scalable, distributed, collaborative, document-graph database, for the realtime web

Rust 28,777 961 Updated Feb 26, 2025

NVIDIA / gdrcopy

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 983 151 Updated Feb 18, 2025

volcengine / veScale

A PyTorch Native LLM Training Framework

Python 736 41 Updated Dec 27, 2024

cloudflare / pingora

A library for building fast, reliable and evolvable network services.

Rust 23,324 1,320 Updated Feb 22, 2025

NVIDIA / cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 7,007 1,939 Updated Feb 26, 2025

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 25,819 2,958 Updated Oct 2, 2024

microsoft / ark

A GPU-driven system framework for scalable AI applications

C++ 112 17 Updated Feb 5, 2025

microsoft / microxcaling

PyTorch emulation library for Microscaling (MX)-compatible data formats

Python 202 30 Updated Sep 23, 2024