Adam Algorithm

https://d2l.ai/chapter_optimization/adam.html

The bias corrections always set v_t = (1)*g_t, what’s the point of this?

I don’t see why v_t is always g_t, can you explain?