Skip to content
View LittleQili's full-sized avatar
🎵
Hope music's always there with you.
🎵
Hope music's always there with you.
  • Shanghai Jiao Tong University
  • Shanghai, China

Highlights

  • Pro

Organizations

@SJTU-CSE

Block or report LittleQili

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A flexible, high-performance, user-friendly computer architecture simulator engine

Go 74 18 Updated May 22, 2025

A highly-flexible GPU simulator for AMD GPUs.

Go 147 31 Updated May 22, 2025

ChampSim is an open-source trace based simulator maintained at Texas A&M University and through the support of the computer architecture community.

C++ 595 497 Updated May 22, 2025

An Agile RISC-V SoC Design Framework with in-order cores, out-of-order cores, accelerators, and more

Scala 1,857 715 Updated May 19, 2025

Rocket Chip Generator

Scala 3,449 1,168 Updated Apr 17, 2025

cluster data collected from production clusters in Alibaba for cluster management research

Jupyter Notebook 1,759 426 Updated Apr 11, 2025

Asynchronous semantics for architectural simulation and synthesis.

Rust 26 4 Updated May 19, 2025

This is the top-level repository for the Accel-Sim framework.

Python 408 144 Updated May 20, 2025

K8s cluster simulator for capacity planning

Go 257 53 Updated May 26, 2023

DRAMsim3: a Cycle-accurate, Thermal-Capable DRAM Simulator

C++ 370 155 Updated Aug 3, 2024

Per-device scrolling prefs on macOS.

Objective-C 2,953 143 Updated Jun 21, 2024

Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM stan…

C++ 335 83 Updated May 7, 2025

RDMA and SHARP plugins for nccl library

C 193 34 Updated Apr 8, 2025

Unified Collective Communication Library

C 252 110 Updated May 22, 2025

Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.

Python 223 5 Updated Apr 30, 2025

A Fast and Extensible DRAM Simulator, with built-in support for modeling many different DRAM technologies including DDRx, LPDDRx, GDDRx, WIOx, HBMx, and various academic proposals. Described in the…

C++ 636 214 Updated Aug 29, 2023

Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025

Python 61 8 Updated May 3, 2025

Ultra | Ultimate | Unified CCL

C++ 71 3 Updated May 23, 2025

NVIDIA Linux open GPU with P2P support

C 1,147 109 Updated May 5, 2025

NCCL Profiling Kit

Python 134 12 Updated Jul 1, 2024

oneAPI Collective Communications Library (oneCCL)

C++ 234 80 Updated May 20, 2025

One second to read GitHub code with VS Code.

TypeScript 23,058 890 Updated Apr 29, 2025

Dissecting NVIDIA GPU Architecture

Cuda 95 29 Updated Jul 11, 2022

A paper list of spiking neural networks, including papers, codes, and related websites. 本仓库收集脉冲神经网络相关的顶会顶刊论文和代码,正在持续更新中。

452 40 Updated Apr 27, 2025

torch_musa is an open source repository based on PyTorch, which can make full use of the super computing power of MooreThreads graphics cards.

Python 397 29 Updated May 8, 2025

A tool for bandwidth measurements on NVIDIA GPUs.

C++ 429 37 Updated Apr 15, 2025

面向多平台编译优化的深度学习中间表示

10 Updated Oct 28, 2024

LLM inference in C/C++

C++ 80,718 11,872 Updated May 23, 2025

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++ 838 56 Updated May 23, 2025
Next