How to use my own images for a CNN with MXNet (ImageRecordIter)?

Exzelsios · April 16, 2018, 11:19am

Hello,

I’ve been stuck for the past few days and don’t really understand where exactly my mistake is.

I basically want to train a CNN with my own images in Python. I’ve been following this tutorial, except for the MNIST dataset.

My images have either a positive or negative label, and I’ve arranged the folders in that way as well. I’ve done as the MXNet IO API said and created three .lst files with im2rec.py (train/test/validation). With those I created the respective .rec and .idx files.

However, I still can’t figure out how to actually use them in my CNN. I’ve had several approaches so far, currently I’m using ImageRecordIter like this:

train_iter = ImageRecordIter(
path_imgrec = ‘./test_train.rec’,
data_shape = (3, 227, 227),
path_imglist = ‘./test_train.lst’,
path_imgidx = ‘./test_train.idx’,
batch_size = 3,
shuffle = True
)

Same for validation and test sets, of course. At first I got an Error at those lines:

for i, (data, label) in enumerate(train_data):
data = data.as_in_context(ctx)
label = label.as_in_context(ctx)

TypeError: ‘DataBatch’ object is not iterable

Error, which I fixed like proposed here.

for i, batch in enumerate(data_iterator):
data = batch.data[0].as_in_context(ctx)
label = batch.label[0].as_in_context(ctx)

Which allowed me to actually run the program, however this is the result I’m getting:

Epoch 0. Loss: 0.0, Train_acc nan, Test_acc nan
Epoch 1. Loss: 0.0, Train_acc nan, Test_acc nan
Epoch 2. Loss: 0.0, Train_acc nan, Test_acc nan
Epoch 3. Loss: 0.0, Train_acc nan, Test_acc nan
Epoch 4. Loss: 0.0, Train_acc nan, Test_acc nan

I do recognize that the error probably is that I’m not correctly iterating through the data, as

def evaluate_accuracy(data_iterator, net):

simply skips over the for loop. Do I have to reset the iterator during every epoch? However, I have no idea how to properly do it and have been stuck for the past few days because of this. I’ve also tried using ImageRecordDataset, RecordFileDataset and the DataLoader in combination with any of these, and couldn’t get it to work. Maybe I’ve even made a mistake at some other step that I don’t know yet. I’d really appreciate any help or pointers.

safrooze · April 19, 2018, 1:07am

You could consider using ImageFolderDataset() which is written for Gluon specifically. But using record files should work as well. Some things to check:

Do the record files actually contain anything (does the file size make sense)?
If you simply iterate through the training data, what does it contain?
Try showing the dataset as an image. Does it look OK?

wilhelm · February 1, 2023, 9:17pm

Did you fix this issue?
I have the same problem

Topic		Replies	Views
Simple network does not learn on own images Performance	0	587	January 5, 2018
Rotating images/arrays	3	805	November 28, 2018
Create a Dataset Using RecordIO Gluon	1	720	March 21, 2019
Predicting image class using CGAN_mnist_R example Discussion	1	348	October 17, 2018
Resnet18 is predicting incorrect label Discussion	1	943	April 21, 2018

How to use my own images for a CNN with MXNet (ImageRecordIter)?

Related Topics