Gluon Implementation in Recurrent Neural Networks

https://en.diveintodeeplearning.org/chapter_recurrent-neural-networks/rnn-gluon.html

Hi @mli , I have a question about the hidden state of RNN with multiple hidden layers.

the hidden state is a list with the length of num_hidden_layers, however, I’m wondering that why the shape of its element is (num_hidden_layers, batch_size, num_hidden_units), instead of (batch_size, num_hidden_units).

Thanks!