NLP prediction using a CNN pretrained model

cosmincatalin · November 14, 2017, 8:53pm

Hi,

Following the tutorial posted here https://mxnet.incubator.apache.org/tutorials/nlp/cnn.html I tried loading the model from the checkpoint for the purpose of using it to make predictions on single samples of text (in other words, batches of 1 sample). However, because the model is trained on a batch of size 50, I have problems loading the model.

sym, arg_params, aux_params = mx.model.load_checkpoint('cnn', 3)
mod = mx.mod.Module(symbol=sym, context=mx.cpu(), label_names=None)
mod.bind(for_training=False, data_shapes=[('data', (1,56))], 
         label_shapes=mod._label_shapes)
mod.set_params(arg_params, aux_params, allow_missing=True)

The above code breaks with:

data: (1, 56)
Error in operator reshape0: [20:32:53] src/operator/tensor/./matrix_op-inl.h:179: Check failed: oshape.Size() == dshape.Size() Target shape size is different to source. Target: 840000
Source: 16800

This is because the CNN model has several Reshape layers which are configured based on the batch size:

conv_input = mx.sym.Reshape(data=embed_layer, target_shape=(batch_size, 1, sentence_size, num_embed))

The questions is how can I load the model and use it for predicting on one sample of text? I do not want to train with a batch size of 1, because that is not optimal.

kevinthesun · November 18, 2017, 1:32am

If the network architecture is related to batch size, you may need to feed in the corresponding batch size of data. A possible solution is to repeat your input data 50 times and composes (50, 56) data shape.

cosmincatalin · November 18, 2017, 6:42am

That is indeed a way of solving the issue and I have actually tried it, alas I don’t think it is elegant to brute force my way into using the model

blattnem · January 26, 2018, 9:23am

I face the same problem. Is there another solution than repeating the data to get the same size as the batch_size?

Topic		Replies	Views
Prediction part of python CNN MXNet code?	7	1429	July 6, 2018
Running inference with varying input size	3	1140	October 20, 2019
Target shape size is different to source Discussion	1	1691	January 30, 2018
Bind error "Target shape size is different to source" Discussion	3	589	September 17, 2018
Corresponding way to load_model to predict of bind Discussion	3	466	January 18, 2019

NLP prediction using a CNN pretrained model

Related Topics