I am interested to see where the runtime goes in my forward and backward pass. Specifically, I want to see how much time is spent on executing each symbol in my computation graph. Could someone please give me some hints on where I should put my
toc? Thank you.