forked from facebookresearch/fairseq
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Deepspeed can be used outside of fbcode (facebookresearch#1727)
Summary: Pull Request resolved: fairinternal/fairseq-py#1727 Reviewed By: myleott Differential Revision: D27213955 Pulled By: sshleifer fbshipit-source-id: be84e7f7c1c55c407ee7445fad9b3026a79763fb
- Loading branch information
1 parent
5c87bb5
commit 8f77e24
Showing
2 changed files
with
27 additions
and
6 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,13 @@ | ||
#!/usr/bin/env bash | ||
rm -rf fsdp_dummy | ||
mkdir -p fsdp_dummy | ||
fairseq-train /private/home/sshleifer/data-bin/stories_mmap \ | ||
--ddp-backend fully_sharded --fp16 --fp16-init-scale 4 \ | ||
--cpu-offload --checkpoint-activations \ | ||
--task language_modeling --tokens-per-sample 256 --batch-size 8 \ | ||
--arch transformer_lm_gpt2_tiny \ | ||
--optimizer cpu_adam --adam-betas "(0.9,0.98)" \ | ||
--lr 0.0001 --lr-scheduler polynomial_decay --warmup-updates 5 --total-num-update 10 \ | ||
--max-update 10 --log-format json --log-interval 1 \ | ||
--save-interval-updates 10 --save-dir fsdp_dummy \ | ||
--restore-file x.pt "$@" |