Lists (1)
Sort Name ascending (A-Z)
Stars
GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…
A collection of tools, code, and documentation to understand the host network on real server hardware.
DeepSeek-V3/R1 inference performance simulator
A Datacenter Scale Distributed Inference Serving Framework
🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手 - 视频字幕生成、断句、校正、字幕翻译全流程处理!- A powered tool for easy and efficient video subtitling.
✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
Analyze computation-communication overlap in V3/R1.
【三年面试五年模拟】AIGC算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
A curated list of resources for using LLMs to develop more competitive grant applications.
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, Parallelism, MLA, etc.
CS-BAOYAN / CSLabInfo2024
Forked from CS-BAOYAN/CSLabInfo2023关于2024年CS保研实验室/导师招生广告的汇总。欢迎想要打广告的小伙伴积极PR,资瓷一下互联网精神吼不吼啊?
The official GitHub page for the survey paper "A Survey of Large Language Models".
Disaggregated serving system for Large Language Models (LLMs).
collection of benchmarks to measure basic GPU capabilities
A large-scale simulation framework for LLM inference
Transformer Encoder PyTorch note