Batch_dot one hot vectors with embeddings results in nan

hankcs · April 14, 2019, 3:38am

Hi there,

I have a one hot tensor like this:

First dim is batch size, second dim is char, third is one hot vector. One batch looks like this:

And an embedding tensor like this:

Each element is in a certain range:

Then I use char_embed = nd.batch_dot(one_hot, bert_embed) to lookup my embedding, the result contains nan which confuses me for a whole day.

If there is no nan in bert_embed, why is nan produced when it is batch_dot with a one hot tensor?

hankcs · April 14, 2019, 5:10pm

The bug only occurs on CPU:

src/operator/nn/./fully_connected-inl.h:200: float16 fully connected layer is currentlyonly supported by CuDNN version.

Well, it works fine on GPU. So, not a big deal.

Topic		Replies	Views
HW 9. How to prevent overflow to NaN Courses	2	597	April 27, 2019
HW 8.3.2 Faster way to obtain embeddings? Courses	2	558	April 16, 2019
Nan in loss after several epochs in SemSeg problem Gluon	4	3297	May 7, 2018
C++ predict out data show -nan(ind)	4	1797	November 15, 2018
Batch Norm And Batch Size 1 Recommendations Discussion	0	371	February 13, 2020