This is the first time I am attempting to use mxnet. However, the problem at hand is a bit challenging and I am unable to get any good examples. I need to create a model that can detect text data in images. I have tried the tesseract package in R. But the results are not very good. Any help in this regard would be great.