This is the first time I am attempting to use mxnet. However, the problem at hand is a bit challenging and I am unable to get any good examples. I need to create a model that can detect text data in images. I have tried the tesseract package in R. But the results are not very good. Any help in this regard would be great.
I have actually written a notebook to do OCR on text using CNN + biLSTM + CTC using Gluon. You can check it out here: https://github.com/ThomasDelteil/Gluon_OCR_LSTM_CTC It still need some polishing to be an official tutorial, but that should get you started.