FP16 train using Gluon

huangh12 · April 14, 2019, 8:37am

I am reading the Gluon part of Mixed precision training using float16.
In the provided exmple here, the dtype before softmaxloss layer is not converted back into float32.
While in the symbolic part, it did, as recommended.

It is advisable to cast the output of the layers before softmax to float32, so that the softmax computation is done in float32. This is because softmax involves large reductions and it helps to keep that in float32 for more precise answer.

So, does the gluon fp16 example will loss precision? If it is, could you please provide a correct version.
Thank you.

ThomasDelteil · April 16, 2019, 9:31am

Yes that’s a good point. You can do pred = net(data).astype('float32').softmax()

Topic		Replies	Views
Multiple losses Gluon	7	3646	June 5, 2018
SoftmaxOutput in gluon Gluon	6	2228	April 10, 2018
Multiple output layers and multiple losses handling Discussion	2	1341	June 13, 2018
Softmaxoutput with temparature	1	441	September 10, 2018
WGAN-gp: can't compute gradient penalty with gluon? Gluon	0	407	October 15, 2020

FP16 train using Gluon

Related Topics