Accuracy issue on video classification

aztc · January 6, 2020, 9:05am

Why the accuracy of video classification model printed on gluoncv 's website can’t match with what is in the train log ?

For example, the top-1 accuracy of slowfast_4x16_resnet50_kinetics400 on kenetics-400 datasets is 75.3 but that in the train log is 67.1.

Here’s the URL https://gluon-cv.mxnet.io/model_zoo/action_recognition.html

zhuyi490 · February 12, 2020, 7:12pm

This is because during training, the performance is evaluated on a single video clip from the entire video.

During testing, we evenly select 10 video clips from the entire video, and perform three-crop augmentation technique. This is the standard evaluation technique adopted in the field. Given the fact that we can see more clips (more temporal information) and more spatial crops (more spatial information), we can obtain much better accuracy. Hope this helps.

Topic		Replies	Views
VideoClsCustom's new_lenght parameter when the number of frames in the videos is fewer than new_lenght Gluon	2	537	May 14, 2021
Video classification - transfer learning Gluon	2	413	February 21, 2020
Finetuneing a pretrained ResNet50_v1d in gluoncv Gluon	1	454	December 31, 2018
FasterRCNN Coco pretrained only predicts human (class 0) Gluon	2	391	October 1, 2019
Accuracy issue on VGG	5	603	April 16, 2020

Accuracy issue on video classification

Related Topics