When and where should use `detach()`?

TriLoon · August 24, 2019, 4:02am

I am working on training a GAN, where Image Pool is used to train the discriminator. But I got a weird error:

Check failed: type_ != nullptr: The any container is empty requested=N5mxnet10Imperative6AGInfoE

Any one can give me some advise ?

spanev · August 24, 2019, 7:49pm

Can share the concerned code?

NRauschmayr · August 26, 2019, 6:31pm

I am not sure whether your question in the subject is related to the error.
Detach removes the NDArray from the graph, so no gradients will be computed. In the context of GANs you need to do this because you update first the discriminator and then the generator. To compute the discriminator loss, you need to create fake data (by the generator). To not update generator twice, you need to detach this fake data from the graph. Here a small example:

with autograd.record():
   output_real,_,_ = discriminator(real_data)
   d_error_real    = loss1(output_real, real_label)
                
   # create fake image and input it to discriminator
   fake_image      = generator(g_input)
   output_fake = discriminator(fake_image.detach())
   d_error_fake    = loss1(output_fake, fake_label)
                
   # total discriminator error
   d_error         = d_error_real + d_error_fake
d_error.backward()
d_trainer.step(batch_size)

with autograd.record():
    fake_image = generator(g_input)
    output_fake, category_prob, continuous_mean = discriminator(fake_image)
    g_error = loss1(output_fake, real_label) 

g_error.backward()

TriLoon · August 27, 2019, 2:10am

Sure, @NRauschmayr posted exactly same code as mine ~

TriLoon · August 27, 2019, 2:16am

Thanks for your great explanations.

But is the graph constructed within with autograd.record() ? or discrimnator.hybridize() ? Or the truth is : discriminator.hybridize() will create a forward graph, but with autograd.record() create a backward graph, and the calculation of gradients and update of weights depend on the backward graph ?

I used

with autograd.record();
       output_real,_,_ = discriminator(real_data)
   d_error_real    = loss1(output_real, real_label)
                
   # create fake image and input it to discriminator
   with autograd.pause():
       fake_image      = generator(g_input)
   output_fake = discriminator(fake_image.detach())
   d_error_fake    = loss1(output_fake, fake_label)
                
   # total discriminator error
   d_error         = d_error_real + d_error_fake
   autograd.backward(d_error)
d_trainer.step(batch_size)
...

At last, is there any difference between autograd.backward(d_error) and d_error.backward() ?

Topic		Replies	Views
Autograd.record error about assigning to NDArray Discussion	4	2705	February 5, 2018
Concise Implementation of Linear Regression D2L Book	11	1850	May 30, 2020
GAN Discussion D2L Book	2	626	August 20, 2020
Difference b/w loss.backward() and mx.autograd.backwars([loss]) Discussion	2	2357	May 14, 2019
Who can help me solve this error？（batch_loss.backward() error when use slice) Discussion	1	2271	March 8, 2018

When and where should use `detach()`?

Related Topics