Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Cuda rng_state_all is used when saving in distributed mode so same sh…
…ould also be used when loading (huggingface#23045) cuda rng state should be all for distributed bc all were saved
- Loading branch information