PyTorch version of memory balanced model parallel
Chinese blog about this repo: http://bindog.github.io/blog/2019/09/05/gpu-memory-balanced-model-parallel/
Updates on APEX fp16 training: http://bindog.github.io/blog/2020/04/12/model-parallel-with-apex/