QuaRot This repository contains the code for QuaRot, an end-to-end 4-bit inference of large language models. Usage Dependencies cmake C++ compiler (GCC/clang/...) nvcc Instructions git clone https://github.com/sashkboos/QuaRot.git cd QuaRot pip install -e . # or pip install .