Skip to content
View yumath's full-sized avatar
:octocat:
:octocat:

Block or report yumath

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 6,326 621 Updated Apr 18, 2025

The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".

Python 20 1 Updated Nov 12, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,680 369 Updated Apr 16, 2025

Code for paper "Achieving Sparse Activation in Small Language Models"

Python 6 Updated Sep 2, 2024

Awesome list for LLM pruning.

222 9 Updated Dec 15, 2024

[NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"

Python 200 11 Updated Sep 30, 2024

Sparseout: Controlling Sparsity in Deep Networks

Python 2 2 Updated Jun 2, 2019

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 6,939 778 Updated Oct 22, 2024

List of papers related to neural network quantization in recent AI conferences and journals.

592 47 Updated Mar 27, 2025

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 60,089 6,074 Updated Aug 24, 2024

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 872 61 Updated Feb 16, 2025

LLM training in simple, raw C/CUDA

Cuda 26,367 3,029 Updated Oct 2, 2024

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 252 31 Updated Mar 23, 2025

Learning Sparse Neural Networks through L0 regularization

Python 240 48 Updated Jul 17, 2020

Model Quantization Benchmark

Python 798 142 Updated Apr 12, 2025

[ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”

Python 121 7 Updated Jan 14, 2025

EE-LLM is a framework for large-scale training and inference of early-exit (EE) large language models (LLMs).

Python 57 6 Updated Jun 14, 2024

📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism etc.

Python 3,851 275 Updated Apr 18, 2025

ChineseNER based on BERT, with BiLSTM+CRF layer

Python 450 96 Updated Jun 18, 2021

Event-based Vision Resources. Community effort to collect knowledge on event-based vision technology (papers, workshops, datasets, code, videos, etc)

3,107 681 Updated Apr 18, 2025

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Python 785 103 Updated Aug 20, 2024

A simple and effective LLM pruning approach.

Python 737 103 Updated Aug 9, 2024

Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)

Python 1,515 293 Updated Jun 7, 2020

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 999 116 Updated Oct 7, 2024
Python 823 55 Updated Oct 19, 2023

Data augmentation for NLP, presented at EMNLP 2019

Python 1,630 316 Updated Mar 19, 2023

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Python 11,882 1,965 Updated Dec 6, 2024

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 11,500 1,625 Updated Apr 7, 2025

A curated list of neural network pruning resources.

2,437 330 Updated Apr 4, 2024
Next