Skip to content
/ QuaRot Public

Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.

License

Notifications You must be signed in to change notification settings

spcl/QuaRot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

QuaRot

This repository contains the code for QuaRot, an end-to-end 4-bit inference of large language models.

Usage

Dependencies

  • cmake
  • C++ compiler (GCC/clang/...)
  • nvcc

Instructions

git clone https://github.com/sashkboos/QuaRot.git
cd QuaRot
pip install -e .  # or pip install .

About

Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published