-
-
-
Paddle Public
Forked from PaddlePaddle/PaddlePArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
C++ Apache License 2.0 UpdatedOct 7, 2024 -
parafuser Public
Parallel inference and training of diffusion models (UNet or Transformer backbone) using my custom methods alongside other open-source repositories.
-
LLM-Viewer Public
Forked from hahnyuan/LLM-ViewerAnalyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.
Python MIT License UpdatedSep 11, 2024 -
Awesome-SD-Distributed-Inference Public
Forked from DefTruth/Awesome-SD-Inference📖A small curated list of Awesome SD/DiT/ViT/Diffusion Distributed/Caching Inference Paper with codes, such as DistriFusion, PipeFusion, AsyncDiff, DeepCache etc.
GNU General Public License v3.0 UpdatedJul 28, 2024 -
xDiT Public
Forked from xdit-project/xDiTA Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters
Python Apache License 2.0 UpdatedJul 27, 2024 -
hello-algo Public
Forked from krahets/hello-algo《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
Java Other UpdatedJul 27, 2024 -
AISystem Public
Forked from chenzomi12/AISystemAISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Jupyter Notebook Apache License 2.0 UpdatedJul 14, 2024 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedJul 9, 2024 -
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
-
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
-
safetensors Public
Forked from huggingface/safetensorsSimple, safe way to store and distribute tensors
Python Apache License 2.0 UpdatedJul 6, 2024 -
AI-System Public
Forked from microsoft/AI-SystemSystem for AI Education Resource.
Python Creative Commons Attribution 4.0 International UpdatedJun 21, 2024 -
MeSolver Public
Forked from FuncJ/MeSolverThe repository maintains the source code for the article titled "Characterize and Optimize Dense Linear Solver on Multi-core CPUs."
UpdatedJun 18, 2024 -
MeAtten Public
Forked from FuncJ/MeAttenThe repository maintains the source code for the article titled "Optimizing Attention by Exploiting Data Reuse on ARM Multi-core CPUs."
Makefile UpdatedJun 13, 2024 -
mnn-Qwen_1.8B Public
AICAS Grand Challenge 2024: Software and Hardware Co-optimization for General Large Language Model Inference on CPU
Python UpdatedApr 8, 2024 -
q-diffusion Public
Forked from Xiuyu-Li/q-diffusion[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.
Python MIT License UpdatedMar 21, 2024 -
sige Public
Forked from lmxyy/sige[NeurIPS 2022, T-PAMI 2023] Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models
Python Other UpdatedMar 18, 2024 -
-
instant-ngp Public
Forked from NVlabs/instant-ngpInstant neural graphics primitives: lightning fast NeRF and more
Cuda Other UpdatedNov 19, 2023 -
paradigms Public
Forked from AndyShih12/paradigmsPyTorch implementation for "Parallel Sampling of Diffusion Models", NeurIPS 2023 Spotlight
Python MIT License UpdatedOct 13, 2023 -
trt-samples-for-hackathon-cn Public
Forked from NVIDIA/trt-samples-for-hackathon-cnSimple samples for TensorRT programming
Python Apache License 2.0 UpdatedOct 12, 2023 -
-
-
-
-
-
-