Calculating loss b/w training

mouryarishik · May 19, 2019, 1:31pm

I was training my model using code below:

%%time
for epoch in range(10):
    for features, labels in mxtrainloader:
        with autograd.record():
            output = mxnet(features.as_in_context(ctx))
            loss = mxobjective(output, labels.as_in_context(ctx))
        loss.backward()
        mxoptimizer.step(features.shape[0])
    print('Epoch:', epoch)

That took 1min 42 secs to complete.

Then I changed the code to:

%%time
for epoch in range(10):
    cum_loss = 0.0
    for batches, (features, labels) in enumerate (mxtrainloader):
        with autograd.record():
            output = mxnet(features.as_in_context(ctx))
            loss = mxobjective(output, labels.as_in_context(ctx))
        loss.backward()
        mxoptimizer.step(features.shape[0])
        cum_loss += loss.mean().asscalar()
        #updating cum_loss 
    print('Epoch:', epoch, 'Loss:', cum_loss/batches)

Now it’s taking 2 mins 51 secs.

That’s a 1.5 times performance drop.

The only change is that I’m updating cum_loss by adding loss.mean().asscalar() to it each iteration!! That’s it!! Only this is causing that much performance drop.
Is there any better way to see training loss each iteration??

Thanks for your time

thomelane · May 21, 2019, 8:15pm

Hi @mouryarishik,

.asscalar() is a blocking call which limits the amount of computation that can happen in parallel. I think this is the most likely reason for the slow down. It can be useful for avoiding out of memory exceptions. You could postpone calling .asscalar() until the end of the epoch (at time of print) rather than the end of each batch.

mouryarishik · May 22, 2019, 2:29pm

Thanks for suggestion, now the performance drop has improved from 1.5x to 1.3x.

Topic		Replies	Views
How to display loss while training? Gluon	2	954	July 15, 2019
Difference b/w loss.backward() and mx.autograd.backwars([loss]) Discussion	2	2357	May 14, 2019
Multiple losses Gluon	7	3653	June 5, 2018
Feeding epoch id into model for loss annealing Discussion	0	447	February 16, 2018
Cannot implement customized loss function?	2	1119	March 25, 2018

Calculating loss b/w training

Related Topics