-
Notifications
You must be signed in to change notification settings - Fork 495
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Do i still need to set kv_store in using mxnet? why? #80
Comments
You don't need to. MXNet-BytePS's implementation bypasses kvstore. |
您好,如果我在使用 mxnet native 进行分布式训练的时候,也单独设置scheduler节点, server节点,worker节点,让它们运行在单独的服务器上。这样的话,使用byteps mxnet 与使用 mxnet native 进行训练,效率上会有差异吗?如果有的话,差异主要来自哪里?谢谢。 |
There will be performance difference even if using the same setup as you said. We did many performance optimizations on BytePS. For example, compared to mxnet native, BytePS-mxnet eliminates some extra copy. BytePS also supports RDMA, which is obviously faster than mxnet-native TCP. We will have a technical report talking about these optimizations in the future. |
thank you |
@bobzhuyb the number of server is same? between mxnet-native and mxnet-byteps. |
Yes. You can try them yourself. The original ps-lite implementation is pretty poor -- it is slower than Horovod, let alone BytePS. |
你好,我这里还有一个问题。在使用原版mxnet 进行分布式训练的时候,不时的会遇到 |
@ZHAIXINGZHAIYUE I believe you won't have that problem if you configure byteps correctly. We never meet this when using byteps. |
Do i still need to set kv_store in using mxnet? why?
The text was updated successfully, but these errors were encountered: