-
-
text-generation-inference Public
Forked from huggingface/text-generation-inferenceLarge Language Model Text Generation Inference
Python Apache License 2.0 UpdatedNov 14, 2024 -
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
AutoAWQ Public
Forked from casper-hansen/AutoAWQAutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.
Python MIT License UpdatedDec 20, 2023