reyoung

Yang Yu reyoung

I am the NLP infra leader for WeChat, was a core developer for Paddle. Accelerate and deploy NLP/ recommendation models.

290 followers · 67 following

Tencent
Beijing

Achievements

x2 x3

Achievements

x2 x3

Stars

microsoft / Tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Python 753 94 Updated Jan 18, 2025

microsoft / mscclpp

MSCCL++: A GPU-driven communication stack for scalable AI applications

C++ 293 44 Updated Feb 5, 2025

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Cuda 720 28 Updated Sep 21, 2024

reugn / async

Synchronization and asynchronous computation package for Go

Go 236 11 Updated Sep 7, 2024

chalk-diagrams / chalk

A declarative drawing API in Python

Python 293 13 Updated Aug 28, 2024

SkyworkAI / skywork-o1-prm-inference

Python 55 3 Updated Nov 26, 2024

Haiyang-W / TokenFormer

[ICLR2025] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

Python 485 39 Updated Feb 6, 2025

borgo-lang / borgo

Borgo is a statically typed language that compiles to Go.

Rust 4,352 60 Updated Oct 27, 2024

mendableai / firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 23,633 1,922 Updated Feb 6, 2025

antgroup / glake

GLake: optimizing GPU memory management and IO transmission.

Python 426 35 Updated Nov 27, 2024

turboderp-org / exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 3,923 299 Updated Feb 4, 2025

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 668 59 Updated Dec 19, 2024

ejoy / ant

Ant game engine

Lua 3,884 401 Updated Feb 5, 2025

lucsky / go-exml

An event based XML parsing API for Go

Go 20 5 Updated Jun 19, 2014

lucidrains / magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Python 587 33 Updated Jan 12, 2025

databricks / megablocks

Python 1,256 179 Updated Nov 20, 2024

LC1332 / Chat-Haruhi-Suzumiya

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 1,893 172 Updated Aug 13, 2024

beordle / termtunnel

Cross-platform terminal tunnel tool

C 357 35 Updated May 21, 2024

xverse-ai / XVERSE-13B

XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.

Python 648 59 Updated Apr 9, 2024

huggingface / candle

Minimalist ML framework for Rust

Rust 16,487 1,018 Updated Feb 4, 2025

SergiusTheBest / exceptxx

C++ exception handling library

C++ 39 14 Updated Jan 17, 2024

triton-lang / triton

Development repository for the Triton language and compiler

C++ 14,283 1,772 Updated Feb 6, 2025

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,981 641 Updated Feb 5, 2025

lucidrains / triton-transformer

Implementation of a Transformer, but completely in Triton

Python 255 15 Updated Apr 5, 2022

VKCOM / YouTokenToMe

Unsupervised text tokenizer focused on computational efficiency

C++ 963 103 Updated Mar 29, 2024

ELS-RD / kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Jupyter Notebook 1,554 95 Updated Feb 16, 2024

allegro / bigcache

Efficient cache for gigabytes of data written in Go.

Go 7,673 602 Updated Jan 20, 2025

facebookresearch / HolisticTraceAnalysis

A library to analyze PyTorch traces.

Python 327 47 Updated Feb 6, 2025

netcan / asyncio

asyncio is a c++20 library to write concurrent code using the async/await syntax.

C++ 860 82 Updated Feb 3, 2024

goplus / c2go

Convert C to Go

Go 303 20 Updated May 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yang Yu reyoung

Achievements

Achievements

Block or report reyoung

Stars

microsoft / Tutel

microsoft / mscclpp

efeslab / Nanoflow

reugn / async

chalk-diagrams / chalk

SkyworkAI / skywork-o1-prm-inference

Haiyang-W / TokenFormer

borgo-lang / borgo

mendableai / firecrawl

antgroup / glake

turboderp-org / exllamav2

zhuzilin / ring-flash-attention

ejoy / ant

lucsky / go-exml

lucidrains / magvit2-pytorch

databricks / megablocks

LC1332 / Chat-Haruhi-Suzumiya

beordle / termtunnel

xverse-ai / XVERSE-13B

huggingface / candle

SergiusTheBest / exceptxx

triton-lang / triton

facebookresearch / xformers

lucidrains / triton-transformer

VKCOM / YouTokenToMe

ELS-RD / kernl

allegro / bigcache

facebookresearch / HolisticTraceAnalysis

netcan / asyncio

goplus / c2go