I have a piece of code that fails with the following error at the trainer.step line - UserWarning: Gradient of Parameter `resnetv20_conv0_weight` on context gpu(0) has not been updated by backward since last `step`. This could mean a bug in your model that made it only use a subset of the Parameter…

It appears the issue was the attach_grad I call before the loss - [x.attach_grad() for x in output] That would cause the gradient graph to be lost - This is an open issue - https://github.com/apache/incubator-mxnet/issues/11865

Implementation of weighted softmax by extending mx.autograd.Function fails

ThomasDelteil September 2, 2019, 12:42am 2

See my other answer to your post: I would recommend using a custom op:

Topic		Replies	Views
About stale gradient Gluon	17	3199	October 19, 2020
Understanding Autograd.backward() with custom parameters for specific layers Discussion	3	780	September 17, 2019
Cannot bind model with custom loss function Discussion	1	1380	January 24, 2018
Multiple losses Gluon	7	3650	June 5, 2018
WGAN-gp: can't compute gradient penalty with gluon? Gluon	0	407	October 15, 2020

Implementation of weighted softmax by extending mx.autograd.Function fails

Related Topics