Recovery from mxent checkpoint for wide & deep , accuracy decrease

glingyan · June 27, 2018, 11:35am

use example/sparse/wide_deep/train.py to train
e.g. in first epoch the accuracy in 0.83 like below

2018-06-27 18:48:53,896 epoch 0, accuracy = 0.8382098765432099
2018-06-27 18:48:53,900 Saved checkpoint to “./checkpoint/checkpoint-0000.params”
2018-06-27 18:48:53,902 Saved optimizer state to “./checkpoint/checkpoint-0000.states”
2018-06-27 18:48:53,902 Training completed.

load the sample and use the validate dataset to score, the accuracy decrease to 0.79
INFO:logger:Finished inference with 16200 images
INFO:logger:Finished with 115108.471217 images per second
INFO:logger:(‘accuracy’, 0.7991358024691358)

ThomasDelteil · July 3, 2018, 5:43pm

@glingyan could you share your entire training and validation code so I can try to understand what happened?

A typical mistake here would be to compare the training accuracy (calculated using the training set you used to train) and the testing accuracy, computed with the unseen testing set.

glingyan · July 3, 2018, 11:08pm

@ThomasDelteil thanks very much , I found the problem
this is because W&D need to use hash() function to preprocess the data before training

Topic		Replies	Views
Optimal hyperparameters for training resnet34_v1 on ImageNet? Discussion	4	2171	July 19, 2018
Multi task learning (The accuracy tested of training dataset is not as high as training accurracy ) Discussion	4	555	July 24, 2018
Terrible classification accuracy of mxnet	2	876	November 15, 2017
Googlenet accuracy issue	0	276	November 29, 2019
Validation accuracy for object detection on custom dataset (SSD model) Gluon	0	425	December 2, 2019

Recovery from mxent checkpoint for wide & deep , accuracy decrease

Related Topics