[Enhancement] Mix precision support for BERT finetuning #793

eric-haibin-lin · 2019-06-24T22:05:47Z

Description

clean up and update documentation
add dtype option to the finetuning script, using AMP

Checklist

Essentials

PR's title starts with a category (e.g. [BUGFIX], [MODEL], [TUTORIAL], [FEATURE], [DOC], etc)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented

Changes

Feature1, tests, (and when applicable, API doc)
Feature2, tests, (and when applicable, API doc)

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

eric-haibin-lin · 2019-06-24T22:06:31Z

@ptrendx FYI

mli · 2019-06-24T23:08:48Z

Job PR-793/1 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-793/1/index.html

codecov · 2019-06-25T05:19:43Z

Codecov Report

Merging #793 into master will decrease coverage by 0.01%.
The diff coverage is 77.77%.

@@            Coverage Diff             @@
##           master     #793      +/-   ##
==========================================
- Coverage   90.61%   90.59%   -0.02%     
==========================================
  Files          64       64              
  Lines        6295     6303       +8     
==========================================
+ Hits         5704     5710       +6     
- Misses        591      593       +2

Impacted Files	Coverage Δ
src/gluonnlp/model/attention_cell.py	`94.8% <77.77%> (-1.09%)`	⬇️

mli · 2019-06-25T06:26:35Z

Job PR-793/2 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-793/2/index.html

mli · 2019-06-25T08:49:52Z

Job PR-793/3 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-793/3/index.html

mli · 2019-06-26T18:31:44Z

Job PR-793/4 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-793/4/index.html

mli · 2019-06-26T22:02:19Z

Job PR-793/5 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-793/5/index.html

eric-haibin-lin · 2019-06-27T01:02:27Z

@ptrendx i think there're multiple places where the range of the scalar (eps, or negative value like this) to an op may be too large or too small for fp16. Is there a better way to fix/truncate them in AMP instead of user's code?

EC2 Default User added 5 commits June 20, 2019 22:20

minor doc cleanup

4be3e17

fp16 support for fine-tune script

a475eee

fix topk. add test

984e76d

Merge remote-tracking branch 'origin/master' into fp16-finetune

f646176

update doc

44688c2

eric-haibin-lin requested a review from szha as a code owner June 24, 2019 22:05

Update run_pretraining_hvd.py

ee955c7

lint

14432b8

Update finetune_classifier.py

181d7e5

Update finetune_classifier.py

07ab537

eric-haibin-lin added the release focus Progress focus for release label Jun 26, 2019

sxjscience approved these changes Jun 27, 2019

View reviewed changes

szha approved these changes Jun 27, 2019

View reviewed changes

eric-haibin-lin merged commit 7f20127 into dmlc:master Jun 27, 2019

eric-haibin-lin deleted the fp16-finetune branch February 2, 2020 06:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Enhancement] Mix precision support for BERT finetuning #793

[Enhancement] Mix precision support for BERT finetuning #793

eric-haibin-lin commented Jun 24, 2019

eric-haibin-lin commented Jun 24, 2019

mli commented Jun 24, 2019

codecov bot commented Jun 25, 2019 •

edited

Loading

mli commented Jun 25, 2019

mli commented Jun 25, 2019

mli commented Jun 26, 2019

mli commented Jun 26, 2019

eric-haibin-lin commented Jun 27, 2019

[Enhancement] Mix precision support for BERT finetuning #793

[Enhancement] Mix precision support for BERT finetuning #793

Conversation

eric-haibin-lin commented Jun 24, 2019

Description

Checklist

Essentials

Changes

Comments

eric-haibin-lin commented Jun 24, 2019

mli commented Jun 24, 2019

codecov bot commented Jun 25, 2019 • edited Loading

Codecov Report

mli commented Jun 25, 2019

mli commented Jun 25, 2019

mli commented Jun 26, 2019

mli commented Jun 26, 2019

eric-haibin-lin commented Jun 27, 2019

codecov bot commented Jun 25, 2019 •

edited

Loading