Bert Transfer Learning

w_a_r_b_e · October 23, 2019, 5:33pm

Looking for an mxnet implementation of a BERT based transfer learning sample (preferably on multi-gpu), where the end layer is customized for a specific use case. I am interested in using the dataset I have, that contains 10 different classes based on topic/ theme.

TristonC · October 23, 2019, 5:53pm

Do you mean fine-tuning?

w_a_r_b_e · October 23, 2019, 6:12pm

Yes, fine-tuning it for a custom application. Unfortunately, did not find a good sample for mxnet framework.

TristonC · October 23, 2019, 6:14pm

Have you checked the gluon-nlp site? The SQuad example may be a example for you. But I don’t think it uses multiple GPU.

ThomasDelteil · October 23, 2019, 6:26pm

@w_a_r_b_e, you can find:

a tutorial on gluon-nlp for fine-tuning for sentence pair classification: http://gluon-nlp.mxnet.io/examples/sentence_embedding/bert.html
a list of scripts for different type of fine-tuning: https://github.com/dmlc/gluon-nlp/tree/master/scripts/bert
a tutorial I wrote on fine-tuning BERT for sentiment analysis:
- See: https://nbviewer.jupyter.org/gist/ThomasDelteil/029e3995560b8de877cfaf0afc7dc9e9
- Download: https://gist.github.com/ThomasDelteil/029e3995560b8de877cfaf0afc7dc9e9#file-bert_sentiment-ipynb

w_a_r_b_e · October 23, 2019, 11:14pm

Super helpful. Thank you!

Topic		Replies	Views
Documentation Request: Model Parallelism Tutorial Performance	6	1841	March 10, 2018
Evaluate accuracy on multi GPU machine Gluon	5	1403	October 10, 2018
How to train a model written in mxnet/gluon on multiple workstations？ Gluon	1	430	September 19, 2018
Single-machine multi-GPU training, time is not speeding up Gluon	5	2162	November 16, 2018
Best practices for prediction on a machine with multiple GPUs	3	1190	November 8, 2017

Bert Transfer Learning

Related Topics