Kvstore for distributed multi-gpu training

we (i’m working w/ @owenataws) were training resnet_v2_34 using cifar10 data. our learning rate was too high. we will have new test results soon and give update then.