LittleQili

Follow

🎵

Hope music's always there with you.

Yijia Diao LittleQili

🎵

Hope music's always there with you.

Follow

PhD student at CSE, SJTU.

70 followers · 125 following

Shanghai Jiao Tong University
Shanghai, China

Achievements

Achievements

Highlights

Pro

Organizations

Lists (3)

Sort

AI Algo

DL compiler

HW&Lib

Starred repositories

sarchlab / akita

A flexible, high-performance, user-friendly computer architecture simulator engine

Go 74 18 Updated May 22, 2025

sarchlab / mgpusim

A highly-flexible GPU simulator for AMD GPUs.

Go 147 31 Updated May 22, 2025

ChampSim / ChampSim

ChampSim is an open-source trace based simulator maintained at Texas A&M University and through the support of the computer architecture community.

C++ 595 497 Updated May 22, 2025

ucb-bar / chipyard

An Agile RISC-V SoC Design Framework with in-order cores, out-of-order cores, accelerators, and more

Scala 1,857 715 Updated May 19, 2025

chipsalliance / rocket-chip

Rocket Chip Generator

Scala 3,449 1,168 Updated Apr 17, 2025

alibaba / clusterdata

cluster data collected from production clusters in Alibaba for cluster management research

Jupyter Notebook 1,759 426 Updated Apr 11, 2025

Synthesys-Lab / assassyn

Asynchronous semantics for architectural simulation and synthesis.

Rust 26 4 Updated May 19, 2025

accel-sim / accel-sim-framework

This is the top-level repository for the Accel-Sim framework.

Python 408 144 Updated May 20, 2025

alibaba / open-simulator

K8s cluster simulator for capacity planning

Go 257 53 Updated May 26, 2023

umd-memsys / DRAMsim3

DRAMsim3: a Cycle-accurate, Thermal-Capable DRAM Simulator

C++ 370 155 Updated Aug 3, 2024

pilotmoon / Scroll-Reverser

Per-device scrolling prefs on macOS.

Objective-C 2,953 143 Updated Jun 21, 2024

CMU-SAFARI / ramulator2

Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM stan…

C++ 335 83 Updated May 7, 2025

Mellanox / nccl-rdma-sharp-plugins

RDMA and SHARP plugins for nccl library

C 193 34 Updated Apr 8, 2025

openucx / ucc

Unified Collective Communication Library

C 252 110 Updated May 22, 2025

OpenMOSS / VLABench

Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.

Python 223 5 Updated Apr 30, 2025

CMU-SAFARI / ramulator

A Fast and Extensible DRAM Simulator, with built-in support for modeling many different DRAM technologies including DDRx, LPDDRx, GDDRx, WIOx, HBMx, and various academic proposals. Described in the…

C++ 636 214 Updated Aug 29, 2023

Yufeng98 / CENT

Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025

Python 61 8 Updated May 3, 2025

uccl-project / uccl

Ultra | Ultimate | Unified CCL

C++ 71 3 Updated May 23, 2025

tinygrad / open-gpu-kernel-modules

Forked from NVIDIA/open-gpu-kernel-modules

NVIDIA Linux open GPU with P2P support

C 1,147 109 Updated May 5, 2025

microsoft / NPKit

NCCL Profiling Kit

Python 134 12 Updated Jul 1, 2024

uxlfoundation / oneCCL

oneAPI Collective Communications Library (oneCCL)

C++ 234 80 Updated May 20, 2025

hongzhangblaze / CS854-F24

37 3 Updated Nov 1, 2024

conwnet / github1s

One second to read GitHub code with VS Code.

TypeScript 23,058 890 Updated Apr 29, 2025

sjfeng1999 / gpu-arch-microbenchmark

Dissecting NVIDIA GPU Architecture

Cuda 95 29 Updated Jul 11, 2022

zhouchenlin2096 / Awesome-Spiking-Neural-Networks

A paper list of spiking neural networks, including papers, codes, and related websites. 本仓库收集脉冲神经网络相关的顶会顶刊论文和代码，正在持续更新中。

452 40 Updated Apr 27, 2025

MooreThreads / torch_musa

torch_musa is an open source repository based on PyTorch, which can make full use of the super computing power of MooreThreads graphics cards.

Python 397 29 Updated May 8, 2025

NVIDIA / nvbandwidth

A tool for bandwidth measurements on NVIDIA GPUs.

C++ 429 37 Updated Apr 15, 2025

deathwings602 / Unified-IR

面向多平台编译优化的深度学习中间表示

10 Updated Oct 28, 2024

ggml-org / llama.cpp

LLM inference in C/C++

C++ 80,718 11,872 Updated May 23, 2025

mirage-project / mirage

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++ 838 56 Updated May 23, 2025

Starred topics

Algorithm

C++

unreal-engine