Hi, I’m training resnet50v2 on FashionMNIST for 5 epochs with batch size/GPU 1024. I’m seeing the following behavior with
- with num_workers=8 avg epoch is 4.2s
- with num_workers=16 avg epoch is 6.2s
Is there a rationale to pick a right num_workers? should it be tuned in HP search?