Skip to content

Commit

Permalink
fix qwen vllm
Browse files Browse the repository at this point in the history
  • Loading branch information
KMnO4-zx committed May 12, 2024
1 parent 559e0c4 commit b89e711
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions Qwen1.5/07-Qwen1.5-7B-Chat vLLM 推理部署调用.md
Original file line number Diff line number Diff line change
Expand Up @@ -288,8 +288,8 @@ python benchmark_throughput.py \
--output-len 128 \
--num-prompts 25 \
--seed 2024 \
--dtype float16
--hf_max_batch_size 25
--dtype float16 \
--hf-max-batch-size 25
```
得到的结果如下所示:
```
Expand Down

0 comments on commit b89e711

Please sign in to comment.