The bias corrections always set v_t = (1)*g_t, what’s the point of this?
I don’t see why v_t is always g_t, can you explain?
The bias corrections always set v_t = (1)*g_t, what’s the point of this?
I don’t see why v_t is always g_t, can you explain?