The mx.symbol.sparse package does not support adding a Dropout layer on a sparse input. Well, technically, it does, but the input is being converted to a dense representation slowing down training considerably.
Is there a reason this is not supported? Any alternatives?