Proper usage of BatchNorm during inference?

use_global_stats=True is only relevant during training. But in general it is better to not use it, because if you don’t use pre-trained weights, the running average estimates will be set to 0 and 1. You can find a very detailed description here: Question about batch normalization