Questions about loss functions

eunseop90 · June 6, 2019, 2:01pm

Greetings,

I am implementing a customized loss function. Doing this thing, I have some question.
In papers, most loss functions are summed.
For examples, if you look at softmaxOutput in mxnet, it just calculates p_i - y_i (where p_i is softmax’s output and y_i is a label), it does not sum up the values.

In my customized loss function, do I implement to only calculate gradient values, not to sum up?

Thank you.

sad · June 7, 2019, 1:27am

Hi,

Can you explain more what you mean and can you point to where you’re looking at in the code where the softmax output is not summed.

Unless you’re referring to the gradients in which case it should not be summed correct?

If you’re looking at this: https://github.com/apache/incubator-mxnet/blob/master/src/operator/softmax_output.cc#L137 then you see that for Softmax output the loss is actually not computed because you only really need the gradient of the loss with respect to softmax and you can compute that without computing the loss.

Topic		Replies	Views
Loss function in Mxnet C++	8	1594	June 22, 2018
Cannot implement customized loss function?	2	1119	March 25, 2018
How to implement custom loss functions without label assignments (unsupervised)? Discussion	10	2800	May 14, 2018
Custom Loss + L2 Regularization Discussion	3	1396	July 6, 2018
Custom loss function from a pre-trained network Discussion	2	831	March 23, 2018

Questions about loss functions

Related Topics