[SCRIPT][API] Add RoBERTa fine-tuning scripts, add BERTClassifier to API #931

eric-haibin-lin · 2019-09-16T06:43:51Z

Description

add roberta argument options to the finetune script
move BERTClassifier to the official API

@hhexiy

Checklist

Essentials

PR's title starts with a category (e.g. [BUGFIX], [MODEL], [TUTORIAL], [FEATURE], [DOC], etc)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented

Changes

Feature1, tests, (and when applicable, API doc)
Feature2, tests, (and when applicable, API doc)

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

This reverts commit 67139eb.

codecov · 2019-09-16T06:43:53Z

Codecov Report

Merging #931 into master will increase coverage by 0.8%.
The diff coverage is 36.36%.

@@           Coverage Diff            @@
##           master    #931     +/-   ##
========================================
+ Coverage   88.89%   89.7%   +0.8%     
========================================
  Files          67      67             
  Lines        6360    6408     +48     
========================================
+ Hits         5654    5748     +94     
+ Misses        706     660     -46

Impacted Files	Coverage Δ
src/gluonnlp/model/bert.py	`84.95% <19.51%> (-14.51%)`	⬇️
src/gluonnlp/data/transforms.py	`76.92% <85.71%> (-4.68%)`	⬇️
src/gluonnlp/model/parameter.py	`92% <0%> (-8%)`	⬇️
src/gluonnlp/data/corpora/wikitext.py	`94.82% <0%> (-5.18%)`	⬇️
src/gluonnlp/data/batchify/batchify.py	`93.18% <0%> (-3.41%)`	⬇️
src/gluonnlp/data/dataset.py	`97.61% <0%> (-1.59%)`	⬇️
src/gluonnlp/data/word_embedding_evaluation.py	`96.21% <0%> (-0.76%)`	⬇️
src/gluonnlp/vocab/subwords.py	`86.95% <0%> (+2.17%)`	⬆️
src/gluonnlp/model/sequence_sampler.py	`91.63% <0%> (+17.07%)`	⬆️
... and 1 more

mli · 2019-09-16T07:17:15Z

Job PR-931/1 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-931/1/index.html

mli · 2019-09-16T17:29:41Z

Job PR-931/2 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-931/2/index.html

mli · 2019-09-16T18:15:50Z

Job PR-931/4 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-931/4/index.html

scripts/bert/finetune_classifier.py

kaonashi-tyc · 2019-09-19T07:43:16Z

RoBERTa model has a slightly different Classifier structure by default (assuming the fairseq as the official implementation):

https://github.com/pytorch/fairseq/blob/718677ebb044e27aaf1a30640c2f7ab6b8fa8509/fairseq/models/roberta/model.py#L218-L235

Might deserve its own Classifier of sort

mli · 2019-09-19T22:42:14Z

Job PR-931/9 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-931/9/index.html

mli · 2019-09-19T23:55:29Z

Job PR-931/10 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-931/10/index.html

mli · 2019-09-20T05:18:53Z

Job PR-931/11 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-931/11/index.html

mli · 2019-09-20T06:02:33Z

Job PR-931/12 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-931/12/index.html

szhengac · 2019-09-20T07:00:42Z

What is this RoBERT? Any reference?

mli · 2019-09-20T07:15:43Z

Job PR-931/13 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-931/13/index.html

eric-haibin-lin · 2019-09-20T21:23:54Z

https://arxiv.org/abs/1907.11692

mli · 2019-09-22T22:17:42Z

Job PR-931/15 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-931/15/index.html

mli · 2019-09-23T06:12:43Z

Job PR-931/16 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-931/16/index.html

src/gluonnlp/data/transforms.py

leezu · 2019-09-23T08:33:33Z

src/gluonnlp/data/transforms.py

@@ -1221,17 +1221,29 @@ class BERTSentenceTransform:
        Tokenizer for the sentences.
    max_seq_length : int.
        Maximum sequence length of the sentences.
+    vocab : Vocab or BERTVocab
+        The vocabulary.


It is not clear that different vocabularies are required/handled for different BERT style models. Let's document that cls_token, sep_token is used if available and otherwise fallback to bos_token, eos_token.

To formally specify the expected attributes ( cls_token, sep_token, etc.) one could (eventually) use Structural subtyping https://mypy.readthedocs.io/en/latest/protocols.html

src/gluonnlp/model/bert.py

kaonashi-tyc

Nit

mli · 2019-09-24T01:57:23Z

Job PR-931/17 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-931/17/index.html

Ubuntu added 5 commits September 16, 2019 04:00

update script

67139eb

add roberta finetune

cc193f1

Revert "update script"

5ec376b

This reverts commit 67139eb.

add early stop

0eeedfc

fix bug

41f9c6a

eric-haibin-lin added 2 commits September 16, 2019 09:56

Update finetune_classifier.py

5491e12

Update bert.py

8559cb6

fix lint

325b1bd

hhexiy reviewed Sep 17, 2019

View reviewed changes

scripts/bert/finetune_classifier.py Outdated Show resolved Hide resolved

Ubuntu added 3 commits September 19, 2019 18:04

fix typo

3429eb1

resolve conflcit

25f50dd

move classifier to API

dc3bb85

eric-haibin-lin requested a review from a team as a code owner September 19, 2019 19:47

Ubuntu added 4 commits September 19, 2019 20:01

merge

3f9e2d4

fix lint

a56960d

add patience for early stopping

8c79358

update test

dfe9e2d

eric-haibin-lin changed the title ~~[SCRIPT] Add RoBERTa fine-tuning scripts~~ [SCRIPT][API] Add RoBERTa fine-tuning scripts, add BERTClassifier to API Sep 19, 2019

Update bert.md

4935626

Update finetune_classifier.py

db2ab06

Ubuntu added 3 commits September 20, 2019 04:46

fix bugs

69d05c6

fix transform

61cf309

fix bug

3d01135

Update test_scripts.py

665faed

eric-haibin-lin added 2 commits September 20, 2019 14:24

Merge branch 'master' into rob-finetune

2c2c4f3

Merge remote-tracking branch 'upstream/master' into rob-finetune

01f38a2

Ubuntu added 2 commits September 23, 2019 05:39

add roberta result

bd502ce

Merge remote-tracking branch 'haibin/rob-finetune' into rob-finetune

e8de695

leezu reviewed Sep 23, 2019

View reviewed changes

kaonashi-tyc reviewed Sep 23, 2019

View reviewed changes

src/gluonnlp/model/bert.py Show resolved Hide resolved

kaonashi-tyc reviewed Sep 23, 2019

View reviewed changes

address comments

cf60091

leezu approved these changes Sep 24, 2019

View reviewed changes

eric-haibin-lin merged commit d63abb8 into dmlc:master Sep 24, 2019

eric-haibin-lin deleted the rob-finetune branch February 2, 2020 06:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SCRIPT][API] Add RoBERTa fine-tuning scripts, add BERTClassifier to API #931

[SCRIPT][API] Add RoBERTa fine-tuning scripts, add BERTClassifier to API #931

eric-haibin-lin commented Sep 16, 2019 •

edited

Loading

codecov bot commented Sep 16, 2019 •

edited

Loading

mli commented Sep 16, 2019

mli commented Sep 16, 2019

mli commented Sep 16, 2019

kaonashi-tyc commented Sep 19, 2019 •

edited

Loading

mli commented Sep 19, 2019

mli commented Sep 19, 2019

mli commented Sep 20, 2019

mli commented Sep 20, 2019

szhengac commented Sep 20, 2019

mli commented Sep 20, 2019

eric-haibin-lin commented Sep 20, 2019

mli commented Sep 22, 2019

mli commented Sep 23, 2019

leezu Sep 23, 2019

kaonashi-tyc left a comment

mli commented Sep 24, 2019

[SCRIPT][API] Add RoBERTa fine-tuning scripts, add BERTClassifier to API #931

[SCRIPT][API] Add RoBERTa fine-tuning scripts, add BERTClassifier to API #931

Conversation

eric-haibin-lin commented Sep 16, 2019 • edited Loading

Description

Checklist

Essentials

Changes

Comments

codecov bot commented Sep 16, 2019 • edited Loading

Codecov Report

mli commented Sep 16, 2019

mli commented Sep 16, 2019

mli commented Sep 16, 2019

kaonashi-tyc commented Sep 19, 2019 • edited Loading

mli commented Sep 19, 2019

mli commented Sep 19, 2019

mli commented Sep 20, 2019

mli commented Sep 20, 2019

szhengac commented Sep 20, 2019

mli commented Sep 20, 2019

eric-haibin-lin commented Sep 20, 2019

mli commented Sep 22, 2019

mli commented Sep 23, 2019

leezu Sep 23, 2019

Choose a reason for hiding this comment

kaonashi-tyc left a comment

Choose a reason for hiding this comment

mli commented Sep 24, 2019

eric-haibin-lin commented Sep 16, 2019 •

edited

Loading

codecov bot commented Sep 16, 2019 •

edited

Loading

kaonashi-tyc commented Sep 19, 2019 •

edited

Loading