Weight Decay

http://d2l.ai/chapter_multilayer-perceptrons/weight-decay.html

In train(lambd) as well as train_gluon(wd), animator.add(epoch+1, …) should be changed into animator.add(epoch, …) because epoch starts from 1 in the for loop.