Skip to content
View lidh15's full-sized avatar

Block or report lidh15

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning

Python 41 3 Updated Dec 2, 2024

This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"

Python 58 2 Updated Dec 26, 2024

VPTQ, A Flexible and Extreme low-bit quantization algorithm

Python 547 39 Updated Dec 31, 2024

Agent S: an open agentic framework that uses computers like a human

Python 731 99 Updated Jan 2, 2025

PyTorch Implementation for Hyperbolic Fine-tuning for LLMs

Python 8 Updated Oct 24, 2024

TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention

Python 25 1 Updated Dec 14, 2024

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Python 674 45 Updated Oct 1, 2024

Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"

Python 17 2 Updated Dec 13, 2024
Python 84 12 Updated Dec 6, 2024

[ICML 2024 Oral] This project is the official implementation of our Accurate LoRA-Finetuning Quantization of LLMs via Information Retention

Python 60 6 Updated Apr 15, 2024

EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Python 236 18 Updated Oct 8, 2024

Distributed LLM and StableDiffusion inference for mobile, desktop and server.

Rust 2,690 144 Updated Oct 23, 2024

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,076 64 Updated Jul 14, 2024

Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 379 32 Updated Aug 11, 2024

A State-Space Model with Rational Transfer Function Representation.

Assembly 75 3 Updated May 17, 2024

PyTorch code for Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers

Python 36 3 Updated Sep 3, 2024

mi-optimize is a versatile tool designed for the quantization and evaluation of large language models (LLMs). The library's seamless integration of various quantization methods and evaluation techn…

Python 20 3 Updated Nov 28, 2024

Implementation for MatMul-free LM.

Python 2,943 187 Updated Nov 5, 2024

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Python 99 13 Updated Oct 15, 2024

Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.

Python 307 26 Updated Nov 26, 2024

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Python 749 57 Updated Oct 8, 2024

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2,192 393 Updated Jan 4, 2025

The code for the paper "Pre-trained Vision-Language Models Learn Discoverable Concepts"

Python 12 Updated Jun 5, 2024

[ICML 2024]: Official implementation for the paper: "Consistent Diffusion Meets Tweedie"

Python 49 4 Updated Apr 26, 2024

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 14,180 1,918 Updated Jan 3, 2025
Python 5 3 Updated Aug 8, 2023

[COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?

Python 61 4 Updated Nov 29, 2024
Next