Skip to content
View takakib123's full-sized avatar

Block or report takakib123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Pruning

15 repositories
Python 52 3 Updated May 31, 2024
Python 45 5 Updated Dec 15, 2024

Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.

Python 71 4 Updated Mar 30, 2023

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 928 109 Updated Oct 7, 2024

Prune a model while finetuning or training.

Jupyter Notebook 397 60 Updated Jun 21, 2022

A simple and effective LLM pruning approach.

Python 705 96 Updated Aug 9, 2024

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Python 759 99 Updated Aug 20, 2024

A curated list for Efficient Large Language Models

Python 1,393 104 Updated Dec 30, 2024

A block pruning framework for LLMs.

Python 15 2 Updated Jun 20, 2024

Official Implementation of SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks

Python 33 1 Updated Jun 25, 2024

BlockDrop: Dynamic Inference Paths in Residual Networks

Python 142 40 Updated Dec 12, 2022

Collection of recent methods on (deep) neural network compression and acceleration.

936 132 Updated Dec 3, 2024

GNN-RL Compression: Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement Learning

Python 62 13 Updated Feb 21, 2023
Python 55 12 Updated Feb 11, 2019

RL-Pruner: Structured Pruning Using Reinforcement Learning for CNN Compression and Acceleration

Python 13 Updated Nov 12, 2024