Computing gradients of intermediate values

mouryarishik · March 28, 2019, 4:33am

Does mxnet allow us to compute gradients of intermediate values?
For example let’s we have attached gradient with w1, then under mx.autograd.record() I have calculated z1 = 1 + w1, z2 = z1 + w1, I know how to calculate gradient of z2 wrt w1 (by calling z2.backward(), then getting gradient by w1.grad) but how to calculate gradients of z2 wrt z1???
Because z1 has not gradient attached to it.

thomelane · April 3, 2019, 12:32am

Can I ask what your usecase is for retriving the intermeidate gradients? Usually we’re only interested in the getting the gradients of the ‘leaf nodes’ (e.g. the parameters), and don’t save the intermediate gradients to save memory.

If you’re only interested in the gradient of the intermediate variable, you can attach_grad on z1 after it is defined, but this will effect the gradient calculation for w1. You can alternatively implement a custom backward method and extract the intermediate gradient from there.

mouryarishik · April 3, 2019, 1:48am

Ok I’ll try it out. Actually I am doing a research work so I need to see what is happening to the gradients of hidden layers that’s why.

adelshafiei · April 23, 2019, 8:25pm

@thomelane How can we attach grad to hidden layers of a loaded pre-trained network? My intention is to call forward and backward and then look at the gradients of the hidden layers. I know how find the gradients of the hidden layers wrt input but I am interested in the gradient of one hidden layer wrt another one.

Topic		Replies	Views
How to calculate gradients for the intermediate outputs using symbol?	0	338	June 9, 2019
Gradient fetching Discussion	2	590	May 31, 2018
Adding network gradient to the computational graph Gluon	3	1647	December 17, 2018
How to implement the addtion of grad in the backback-propagating,how to add extra term (which is the gradient to middle net layer output) to the network	2	592	August 18, 2018
Computing per-class gradients	5	651	August 16, 2018

Computing gradients of intermediate values

Related Topics