forked from mst272/LLM-Dojo
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request mst272#13 from aJupyter/main
update readme and run_example.sh
- Loading branch information
Showing
5 changed files
with
21 additions
and
4 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,10 @@ | ||
deepspeed --include localhost:0,1 main_train.py\ | ||
--train_data_path 数据集路径\ | ||
--model_name_or_path 模型路径\ | ||
--task_type sft\ | ||
--train_mode qlora\ | ||
--output_dir 输出路径 | ||
|
||
# task_type:[pretrain, sft, dpo_multi, dpo_single] | ||
|
||
# python main_train.py --train_data_path 数据集路径 --model_name_or_path 模型路径 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,7 @@ | ||
from .dpo.dpo_config import TrainArgument as dpo_TrainArgument | ||
from .sft.lora_qlora.base import TrainArgument as sft_TrainArgument | ||
|
||
__all__ = [ | ||
"dpo_TrainArgument", | ||
"sft_TrainArgument", | ||
] |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters