Skip to content

Latest commit

 

History

History
9 lines (6 loc) · 468 Bytes

README.md

File metadata and controls

9 lines (6 loc) · 468 Bytes

AWQ Examples

Here we provide two AWQ examples, applying to:

  • Vicuna-7B, a chatbot with instruction-tuning
  • LLaVA-13B, a visual LM for multi-modal applications like visual reasoning.

Here are some example output from the two demos. You should able to observe memory saving when running the demos in 4-bit. Please check the notebooks for details.

overview