Skip to content

Commit

Permalink
update rlhf ing
Browse files Browse the repository at this point in the history
  • Loading branch information
mst272 committed Jul 9, 2024
1 parent 001a446 commit 6f9be79
Showing 1 changed file with 7 additions and 7 deletions.
14 changes: 7 additions & 7 deletions rlhf/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -54,13 +54,13 @@ ds.yaml文件中main_process_port如果被占用则加一个数字即可。错
### 显存实验
res——length为64

| **RLHF** | **deepspeed** | **方式** | **Reward Model** | **SFT Model** | **显存占用** |
|----------|------------------|--------|------------------|---------------|-----------------------|
| RLOO | Zero 3 cpu cpu | Lora | QWEN2(7B) | QWEN2(7B) | 2 x A100(40GB):15~30G |
| RLOO | Zero 3 cpu cpu | Full | QWEN2(7B) | QWEN2(7B) | |
| RLOO | Zero 2 cpu cpu | Qlora | QWEN2(7B) | QWEN2(7B) | 2 x A100(40GB):30~40G |
| | | | | | |
| | | | | | |
| **RLHF** | **deepspeed** | **方式** | **Reward Model** | **SFT Model** | **显存占用** |
|----------|-----------------|--------|------------------|---------------|-----------------------|
| RLOO | Zero 3 cpu cpu | Lora | QWEN2(7B) | QWEN2(7B) | 2 x A100(40GB):15~30G |
| RLOO | Zero 3 cpu cpu | Full | QWEN2(7B) | QWEN2(7B) | 2 x A100(40GB):速度很慢 |
| RLOO | Zero 2 cpu cpu | Qlora | QWEN2(7B) | QWEN2(7B) | 2 x A100(40GB):30~40G |
| | | | | | |
| | | | | | |



Expand Down

0 comments on commit 6f9be79

Please sign in to comment.