What will happen when I build two dist-* kvstore?


#1

I wrote a python program:

# file : test.py
import mxnet as mx
kv0 = mx.kvstore.create('dist-sync')
kv1 = mx.kvstore.create('dist-sync')
print('something')

And i use launch.py to start distributed training:

${MXNET_PATH}/tools/launch.py -n 3 -H hosts --launcher ssh python test.py

The program stuck, I don’t know why.


#2

Hi @ZhouJ

What’s your requirement for 2 kvstores? You should be able to just use a single distributed kvstore. With a single kvstore, does the launch.py script work okay?