Symbolic mode: how to block the gradient in the graph?

kaizhao · November 19, 2019, 2:11pm

I’m training a image classification model using the mxnet.sym api.

A brief illustration of my model is as below:

Besides the prediction and label, the loss function accepts a weights from the intermediate features
as input.

However, I don’t want the gradient to propagate through the weights variable.
In other words, I need the weights to acting as a static ndarray (just like the Lable),
rather than a Variable.

So how can I disable the gradient back-prop through the variable weights?

Currently I use the mx.symbol.BlockGrad():

feature = conv(data)
weights = linear1(feature)
weights = mx.symbol.BlockGrad(weights)
prediction = linear2(feature)
loss = softmax_loss(prediction, target)

Is it correct to do so?

spanev · November 19, 2019, 8:10pm

Hi @kaizhao,

Yes that’s the way to do it.

After doing weights = mx.symbol.BlockGrad(weights), weights won’t be contributing to the gradients the nodes before linear1(feature) (this node included).

kaizhao · November 20, 2019, 6:26am

Hi @spanev, thanks for solving my problem!

Topic		Replies	Views
Adding network gradient to the computational graph Gluon	3	1642	December 17, 2018
Non-negative weight matrix where each row lies on a simplex Discussion	0	398	November 18, 2017
Differentiating specific softmax output label with respect to input image Discussion	1	785	October 11, 2017
Implementation of weighted softmax by extending mx.autograd.Function fails	2	647	September 2, 2019
About stale gradient Gluon	17	3199	October 19, 2020

Symbolic mode: how to block the gradient in the graph?

Related Topics