🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 60,089 6,074 Updated Aug 24, 2024

princeton-nlp / SimPO

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 872 61 Updated Feb 16, 2025

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 26,367 3,029 Updated Oct 2, 2024

hemingkx / Spec-Bench

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 252 31 Updated Mar 23, 2025

AMLab-Amsterdam / L0_regularization

Learning Sparse Neural Networks through L0 regularization

Python 240 48 Updated Jul 17, 2020

ModelTC / MQBench

Model Quantization Benchmark

Python 798 142 Updated Apr 12, 2025

YuchuanTian / RethinkTinyLM

[ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”

Python 121 7 Updated Jan 14, 2025

pan-x-c / EE-LLM

EE-LLM is a framework for large-scale training and inference of early-exit (EE) large language models (LLMs).

Python 57 6 Updated Jun 14, 2024

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism etc.

Python 3,851 275 Updated Apr 18, 2025

yumath / bertNER

ChineseNER based on BERT, with BiLSTM+CRF layer

Python 450 96 Updated Jun 18, 2021

uzh-rpg / event-based_vision_resources

Event-based Vision Resources. Community effort to collect knowledge on event-based vision technology (papers, workshops, datasets, code, videos, etc)

3,107 681 Updated Apr 18, 2025

IST-DASLab / sparsegpt

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Python 785 103 Updated Aug 20, 2024

locuslab / wanda

A simple and effective LLM pruning approach.

Python 737 103 Updated Aug 9, 2024

Eric-mingjie / rethinking-network-pruning

Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)

Python 1,515 293 Updated Jun 7, 2020

horseee / LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 999 116 Updated Oct 7, 2024

huawei-noah / VanillaNet

Python 823 55 Updated Oct 19, 2023

jasonwei20 / eda_nlp

Data augmentation for NLP, presented at EMNLP 2019

Python 1,630 316 Updated Mar 19, 2023

xmu-xiaoma666 / External-Attention-pytorch

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Python 11,882 1,965 Updated Dec 6, 2024

jacobgil / pytorch-grad-cam

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 11,500 1,625 Updated Apr 7, 2025

he-y / Awesome-Pruning

A curated list of neural network pruning resources.

2,437 330 Updated Apr 4, 2024

南雍山野猪骑士 yumath

Lists (5)

event camera

llm

pruning

quant

regularization

Stars