Skip to content
View zxzx9898's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report zxzx9898

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A highly-flexible GPU simulator for AMD GPUs.

Go 147 31 Updated May 22, 2025

GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…

C++ 1,332 554 Updated Feb 15, 2025

A collection of tools, code, and documentation to understand the host network on real server hardware.

Python 35 3 Updated Dec 1, 2024

DeepSeek-V3/R1 inference performance simulator

Jupyter Notebook 129 17 Updated Mar 27, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 4,081 374 Updated May 23, 2025

🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手 - 视频字幕生成、断句、校正、字幕翻译全流程处理!- A powered tool for easy and efficient video subtitling.

Python 6,955 577 Updated Apr 18, 2025
Python 65 6 Updated Apr 2, 2025

✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】

Jupyter Notebook 10,238 1,249 Updated May 17, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 1,260 188 Updated May 23, 2025

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 1,117 74 Updated May 22, 2025

Analyze computation-communication overlap in V3/R1.

1,039 142 Updated Mar 21, 2025
Python 45 5 Updated Jun 27, 2024

【三年面试五年模拟】AIGC算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。

1,635 208 Updated May 22, 2025

A curated list of resources for using LLMs to develop more competitive grant applications.

Python 3,556 457 Updated Mar 1, 2024
Python 307 41 Updated Aug 20, 2024

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".

1,526 96 Updated Apr 4, 2025

📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, Parallelism, MLA, etc.

Python 4,027 277 Updated May 18, 2025

Microsoft Azure Traces

Jupyter Notebook 924 158 Updated Feb 25, 2025

关于2024年CS保研实验室/导师招生广告的汇总。欢迎想要打广告的小伙伴积极PR,资瓷一下互联网精神吼不吼啊?

418 46 Updated Sep 29, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 11,506 888 Updated Mar 11, 2025

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 592 61 Updated Apr 6, 2025

collection of benchmarks to measure basic GPU capabilities

C++ 374 51 Updated Feb 11, 2025
HTML 197 33 Updated Jan 2, 2025
Python 30 5 Updated Jun 7, 2024

2020 PTA History Test Questions

Java 14 19 Updated Feb 7, 2023

A large-scale simulation framework for LLM inference

Python 375 64 Updated Nov 19, 2024

Transformer Encoder PyTorch note

Jupyter Notebook 109 8 Updated Jun 20, 2023

对llama3进行全参微调、lora微调以及qlora微调。

Python 197 16 Updated Oct 4, 2024
Next