update README.md

shadowpa0327 · Aug 1, 2024 · dee2d8e · dee2d8e
1 parent 7509f40
commit dee2d8e
Showing 1 changed file with 2 additions and 2 deletions.
diff --git a/README.md b/README.md
@@ -80,6 +80,8 @@ python run_ppl_eval.py \
 --lt_hadamard 
 ```
 
+*Note*: `run_ppl_eval.py` hasn't suppoted multi-gpu evaluation yet. If your machine have multiple GPUs, please set `CUDA_VISIBLE_DEVICES` to the desired GPU id.
+
 #### Zero-shot Evaluation
 To run zero-shot evaluations of models with compressed KV-Cache, we can use the `run_lm_eval.py` script, which implement a wrapper around the [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness/tree/big-refactor) library. 
 
@@ -131,8 +133,6 @@ CUDA_VISIBLE_DEVICES=0 python run_latency_kernel.py \
     --total_rank 1024  --group_size 4
 ```
 
-
-
 ## Reference
 If you find this work useful, please consider citing our paper:
 ```