I am a newbie, trying to deploy a distributed environment for a data parallel training setup. But I am struggling with the ec2 cluster setup. I did follow the deeplearning.template and the stack setup described here
Its probably a bit outdated and a little perplexing for a newbie. Do we have an elaborated or walkthrough version of how to launch distributed cluster on ec2 and train the examples in distributed manner?