How to add a L2 Penalty function in mxnet's LR model?

janelu9 · June 29, 2018, 6:51am

I cont use gluon’s LR for I have a large data which must be loaded by dataiter of mxnet, but i don’t know if it has a L2 or L1 penalty function , how to deal with it?

ThomasDelteil · July 3, 2018, 4:59pm

@janelu9, can you precise why you cannot use Gluon for loading your data?
Are you aware of the Dataset and DataLoader classes that might help you with that? DataLoader allows the use of multiple workers for asynchronously pre-fetching data effectively.

You can use the weight decay wd parameter of the trainer, this wd parameter is accepted by all optimizers. In most cases you can see it as L2 regularization, and it is precisely true (with a factor 2) when using SGD. More details here: https://bbabenko.github.io/weight-decay/

trainer = gluon.Trainer(net.collect_params(), 'sgd', 
                        {'learning_rate': LEARNING_RATE,
                         'wd':WDECAY,
                         'momentum':MOMENTUM})

Topic		Replies	Views
There are some question during the training process Discussion	1	460	June 1, 2018
Load checkpoint and train Gluon	1	1275	July 19, 2019
Loading sparse data into gluon's DataLoader? Gluon	2	518	December 1, 2019
Gluon: Per-layer learning rate for fine tuning a pretrained network	1	1011	November 27, 2018
Lower accuracy on Cifar10 with multi-gpu implementation	5	599	August 23, 2018

How to add a L2 Penalty function in mxnet's LR model?

Related Topics