I’m porting the Wave-U-Net from tensorflow to mxnet and I need to upsample the 3D audio features with bilinear interpolation. However, there’s no bilinear upsampling operator in mxnet for 1D audio data ( 3d feature map of shape [batch_size, channels, length]
).
First, I use mx.nd.contrib.BilinearResize2D
by reshaping the feature from [B, C, T]
to [B, C, T, 1]
. But the question is, the width and height of BilinearResize2D are in int16_t
and my T
is larger than 65535. Also the mx.init.Bilinear
does not support weight less than 3 dimensions.
How to implement BilinearResize1D in mxnet gluon?