ywang96

Roger Wang ywang96

109 followers · 28 following

11:42 (UTC -07:00)
in/rogerywang
@rogerw0108

Achievements

x3 x4

Achievements

x3 x4

Organizations

Stars

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 3,714 288 Updated Apr 16, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,027 141 Updated Apr 15, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

C++ 11,439 821 Updated Mar 1, 2025

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 2,105 187 Updated Apr 14, 2025

xjdr-alt / entropix

Entropy Based Sampling and Parallel CoT Decoding

Python 3,353 319 Updated Nov 13, 2024

hu-po / docs

documentation for content creation

HTML 192 19 Updated Feb 13, 2025

lucidrains / transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 1,053 46 Updated Mar 18, 2025

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 7,294 1,196 Updated Apr 10, 2025

fixie-ai / ultravox

A fast multimodal LLM for real-time voice

Python 3,840 287 Updated Feb 14, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 45,031 6,905 Updated Apr 16, 2025

vllm-project / vllm-nccl

Manages vllm-nccl dependency

Python 17 3 Updated Jun 3, 2024

zeux / calm

CUDA/Metal accelerated language model inference

C 540 25 Updated Apr 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Roger Wang ywang96

Achievements

Achievements

Organizations

Block or report ywang96

Stars

ai-dynamo / dynamo

hiyouga / EasyR1

deepseek-ai / FlashMLA

BBuf / how-to-optim-algorithm-in-cuda

xjdr-alt / entropix

hu-po / docs

lucidrains / transfusion-pytorch

NVIDIA / cutlass

fixie-ai / ultravox

vllm-project / vllm

vllm-project / vllm-nccl

zeux / calm