Skip to content
View lliai's full-sized avatar

Block or report lliai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • D2MoE Public

    D^2-MoE: Delta Decompression for MoE-based LLMs Compression

    Python 15 Apache License 2.0 Updated Feb 25, 2025
  • Paper survey of efficient computation for large scale models.

    32 Apache License 2.0 Updated Dec 7, 2024
  • DSA Public

    Discovering Sparsity Allocation for Layer-wise Pruning of Large Language Models

    Python Updated Oct 29, 2024
  • ALS Public

    ALS

    1 MIT License Updated Oct 26, 2024
  • Auto-DAS Public

    [ECCV2024] Auto-DAS: Automated Proxy Discovery for Training-free Distillation-aware Architecture Search

    Python 2 1 Apache License 2.0 Updated Oct 22, 2024
  • Auto-GAS Public

    [ECCV2024] Auto-GAS: Automated Proxy Discovery for Training-free Generative Architecture Search

    Python 2 Apache License 2.0 Updated Oct 22, 2024
  • VLoRA Public

    Forked from FeipengMa6/VLoRA

    [NeurIPS 2024] Visual Perception by Large Language Model’s Weights

    Python Apache License 2.0 Updated Oct 17, 2024
  • Awesome-Low-Rank-Adaptation

    80 11 Updated Oct 13, 2024
  • OLMoE Public

    Forked from allenai/OLMoE

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook Apache License 2.0 Updated Sep 4, 2024
  • A reading list on LLM based Synthetic Data Generation 🔥

    MIT License Updated Aug 12, 2024
  • SANE Public

    Forked from HSG-AIML/SANE

    Code Repository for the ICML 2024 paper: "Towards Scalable and Versatile Weight Space Learning".

    Python Updated Jul 11, 2024
  • DetKDS Public

    [ICML2024] DetKDS: Knowledge Distillation Search for Object Detectors

    Python 8 Apache License 2.0 Updated Jul 11, 2024
  • AttnZero Public

    [ECCV2024] AttnZero: Efficient Attention Discovery for Vision Transformers

    4 Apache License 2.0 Updated Jul 10, 2024
  • Official code for the paper "Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark"

    Python MIT License Updated Jun 26, 2024
  • LGD Public

    Forked from mZhenz/LGD

    Lightweight Model Pre-training via Language Guided Knowledge Distillation

    Jupyter Notebook Updated Jun 24, 2024
  • MATES Public

    Forked from cxcscmu/MATES

    Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models

    Python MIT License Updated Jun 13, 2024
  • DiscoPOP Public

    Forked from luchris429/DiscoPOP

    Code for Discovering Preference Optimization Algorithms with and for Large Language Models

    Python MIT License Updated Jun 13, 2024
  • KAN-EA Public

    Forked from hhyqhh/KAN-EA
    Python Updated May 28, 2024
  • SimDA Public

    Forked from ChenHsing/SimDA

    [CVPR 2024] SimDA: Simple Diffusion Adapter for Efficient Video Generation

    Python Updated May 7, 2024
  • Awesome Knowledge-Distillation for CV

    74 5 Updated Apr 30, 2024
  • Auto-Prox-AAAI24

    Python 11 Apache License 2.0 Updated Apr 30, 2024
  • Python Updated Apr 23, 2024
  • loretta Public

    Forked from yifanycc/loretta

    [NAACL 24] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models

    Python GNU General Public License v3.0 Updated Apr 19, 2024
  • AdaGP Public

    Forked from BaiTheBest/SparseLLM

    Official Repo for Adaptive Global Pruning of LLMs

    Python Apache License 2.0 Updated Apr 15, 2024
  • A curated list for Efficient Large Language Models

    Python Updated Apr 10, 2024
  • T-GATE Public

    Forked from HaozheLiu-ST/T-GATE

    Accelerating Text-to-Image Diffusion Model for Free

    Python MIT License Updated Apr 4, 2024
  • Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]

    Python Updated Apr 3, 2024
  • zigma Public

    Forked from CompVis/zigma

    The official implementation of "ZigMa: A DiT-Style Mamba-based Diffusion Model

    Python Apache License 2.0 Updated Apr 2, 2024
  • TFMQ-DM Public

    Forked from ModelTC/TFMQ-DM

    [CVPR 2024] TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models

    Jupyter Notebook Apache License 2.0 Updated Mar 31, 2024
  • APQ-DM Public

    Forked from ChangyuanWang17/APQ-DM

    This is the official pytorch implementation for the paper: Towards Accurate Post-training Quantization for Diffusion Models.(CVPR24)

    Python Apache License 2.0 Updated Mar 27, 2024