- Beijing
-
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedJul 24, 2024 -
litgpt Public
Forked from Lightning-AI/litgptPretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
Python Apache License 2.0 UpdatedJun 24, 2024 -
rocmstat Public
Forked from wookayin/gpustat📊 A simple command-line utility for querying and monitoring GPU status
Python UpdatedApr 15, 2024 -
PaperListTemplate Public
This template makes it easy for you to manage papers.
-
Awesome-Efficient-LLM Public
Forked from horseee/Awesome-Efficient-LLMA curated list for Efficient Large Language Models
UpdatedSep 4, 2023 -
LoRA Public
Forked from microsoft/LoRACode for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Python MIT License UpdatedMay 25, 2023 -
simplenote-android Public
Forked from Automattic/simplenote-androidSimplenote for Android
Java GNU General Public License v2.0 UpdatedMay 12, 2023 -
llama Public
Forked from meta-llama/llamaInference code for LLaMA models
Python GNU General Public License v3.0 UpdatedMar 7, 2023 -
attention-is-all-you-need-paper Public
Forked from brandokoch/attention-is-all-you-need-paperImplementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.
Jupyter Notebook MIT License UpdatedJan 16, 2023 -
-
EagleEyeEFF Public
Implement channel pruning using the latest Torch.FX feature !!! && EagleEye reimplementation
-
-
examples-run Public
Forked from pytorch/examplesA set of examples around pytorch in Vision with TRAINING BASH.
-
tutorials Public
Forked from pytorch/tutorialsPyTorch tutorials.
Python BSD 3-Clause "New" or "Revised" License UpdatedMar 13, 2022 -
EagleEye Public
Forked from anonymous47823493/EagleEye(ECCV'2020 Oral)EagleEye: Fast Sub-net Evaluation for Efficient Neural Network Pruning
Python UpdatedMar 9, 2022 -
tvm Public
Forked from apache/tvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Python Apache License 2.0 UpdatedFeb 28, 2022 -
pytorch-cifar Public
Forked from kuangliu/pytorch-cifar95.47% on CIFAR10 with PyTorch
Python MIT License UpdatedJan 18, 2022 -
EfficientPyTorch Public
Forked from wangying-ict/LLSQA PyTorch Framework for Efficient Pruning and Quantization for specialized accelerators.
-
pytorch-cifar-models Public
Forked from chenyaofo/pytorch-cifar-modelsPretrained models on CIFAR10/100 in PyTorch
Python BSD 3-Clause "New" or "Revised" License UpdatedNov 30, 2021 -
ABCPruner Public
Forked from lmbxmu/ABCPrunerPytorch implementation of our paper accepted by IJCAI 2020 -- Channel Pruning via Automatic Structure Search
Python UpdatedNov 30, 2021 -
supermariopy Public
Forked from theRealSuperMario/supermariopypython library, scripts and notebooks that are usfull from time to time
Python MIT License UpdatedOct 12, 2021 -
MQBench Public
Forked from ModelTC/MQBenchModel Quantization Benchmark
Python Apache License 2.0 UpdatedOct 9, 2021 -
aimet Public
Forked from quic/aimetAIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
Python Other UpdatedSep 29, 2021 -
Asks: Convolution with any-shape kernels for efficient neural networks (Neurocomputing.2021)
Python UpdatedSep 27, 2021 -
BitSplit Public
Forked from peiswang/BitSplitBitSplit Post-trining Quantization
Python Apache License 2.0 UpdatedSep 23, 2021 -
-
awesome-image-transformer Public
Forked from rajatsaini0294/awesome-image-transformerList of all the papers on Transformers for Vision.
Apache License 2.0 UpdatedFeb 12, 2021 -
Dynamic-convolution-Pytorch Public
Forked from kaijieshi7/Dynamic-convolution-PytorchPytorch!!!Pytorch!!!Pytorch!!! Dynamic Convolution: Attention over Convolution Kernels (CVPR-2020)
Python UpdatedJan 6, 2021 -
-
LSQuantization Public
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)