-
BitDistiller Public
Forked from DD-DuDa/BitDistiller[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.
Python MIT License UpdatedJan 3, 2025 -
composable_kernel Public
Forked from ROCm/composable_kernelComposable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
C++ Other UpdatedOct 9, 2024 -
hoi_llama.cpp Public
Forked from HoiV/llama.cppLLM inference in C/C++
C++ MIT License UpdatedJul 19, 2024 -
xformers Public
Forked from facebookresearch/xformersHackable and optimized Transformers building blocks, supporting a composable construction.
Python Other UpdatedApr 19, 2024 -
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedApr 9, 2024 -
Welder Public
Forked from nox-410/WelderOSDI 2023 Welder, deeplearning compiler
Python UpdatedMar 21, 2024 -
nnfusion Public
Forked from microsoft/nnfusionA flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
C++ MIT License UpdatedJan 16, 2024 -
nni Public
Forked from J-shang/nniAn open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Python MIT License UpdatedJun 9, 2022 -
TorchSharpExamples Public
Forked from dotnet/TorchSharpExamplesRepository for TorchSharp examples and tutorials.
Jupyter Notebook MIT License UpdatedApr 8, 2022