[SCRIPT] XLNet squad finetuning scripts #1130

zburning · 2020-01-30T05:08:27Z

Description

add XLNet squad finetuning scripts.
add corresponding results.

Checklist

Essentials

PR's title starts with a category (e.g. [BUGFIX], [MODEL], [TUTORIAL], [FEATURE], [DOC], etc)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented

Changes

Feature1, tests, (and when applicable, API doc)
Feature2, tests, (and when applicable, API doc)

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

test

deleting trailing white space

merge from master

codecov · 2020-01-30T05:08:30Z

Codecov Report

Merging #1130 into master will increase coverage by 0.09%.
The diff coverage is 92.3%.

@@            Coverage Diff             @@
##           master    #1130      +/-   ##
==========================================
+ Coverage   88.29%   88.39%   +0.09%     
==========================================
  Files          67       71       +4     
  Lines        6316     6703     +387     
==========================================
+ Hits         5577     5925     +348     
- Misses        739      778      +39

Impacted Files	Coverage Δ
src/gluonnlp/data/transforms.py	`83.05% <100%> (ø)`	⬆️
src/gluonnlp/data/question_answering.py	`100% <100%> (ø)`	⬆️
src/gluonnlp/model/bert.py	`92.65% <100%> (ø)`	⬆️
src/gluonnlp/data/utils.py	`85.79% <89.58%> (ø)`	⬆️
src/gluonnlp/model/train/cache.py	`97.67% <0%> (ø)`	⬆️
src/gluonnlp/embedding/evaluation.py	`95.79% <0%> (ø)`	⬆️
src/gluonnlp/data/batchify/language_model.py	`96.26% <0%> (ø)`	⬆️
src/gluonnlp/model/translation.py	`71.87% <0%> (ø)`	⬆️
src/gluonnlp/model/train/language_model.py	`88.51% <0%> (ø)`	⬆️
src/gluonnlp/model/language_model.py	`98.49% <0%> (ø)`	⬆️
... and 21 more

mli · 2020-01-30T15:48:45Z

Job PR-1130/3 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-1130/3/index.html

eric-haibin-lin · 2020-01-31T06:43:07Z

@leezu what do you think of adding 'XLNet' to gluonnlp.model API, and have a folder for squad finetuning with BERT and XLNet? We don't necessarily need to expose all intermediate block APIs (e.g. attention cell)

mli · 2020-01-31T06:48:42Z

Job PR-1130/4 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-1130/4/index.html

mli · 2020-01-31T09:39:52Z

Job PR-1130/5 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-1130/5/index.html

mli · 2020-02-01T05:42:42Z

Job PR-1130/6 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-1130/6/index.html

scripts/language_model/model/qa.py

scripts/bert/data/preprocessing_utils.py

mli · 2020-02-02T04:34:58Z

Job PR-1130/7 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-1130/7/index.html

mli · 2020-02-02T18:58:45Z

Job PR-1130/8 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-1130/8/index.html

mli · 2020-02-02T18:58:47Z

Job PR-1130/9 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-1130/9/index.html

mli · 2020-02-02T19:52:58Z

Job PR-1130/10 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-1130/10/index.html

leezu · 2020-02-03T00:13:02Z

There seems to be a bug

[2020-02-02T20:03:05.993Z] Traceback (most recent call last):

[2020-02-02T20:03:05.993Z]   File "./scripts/language_model/run_squad.py", line 200, in <module>

[2020-02-02T20:03:05.993Z]     xlnet_base, vocab, tokenizer = model.get_model(**get_model_params)

[2020-02-02T20:03:05.993Z]   File "/var/lib/jenkins/workspace/gluon-nlp-gpu-py3/scripts/language_model/transformer/model.py", line 62, in get_model

[2020-02-02T20:03:05.993Z]     return models[name](**kwargs)

[2020-02-02T20:03:05.993Z]   File "/var/lib/jenkins/workspace/gluon-nlp-gpu-py3/scripts/language_model/transformer/model.py", line 193, in xlnet_cased_l12_h768_a12

[2020-02-02T20:03:05.993Z]     ignore_extra=not kwargs.get('use_decoder', True))

[2020-02-02T20:03:05.993Z]   File "/var/lib/jenkins/workspace/gluon-nlp-gpu-py3/src/gluonnlp/model/utils.py", line 281, in _load_pretrained_params

[2020-02-02T20:03:05.993Z]     net.load_parameters(model_file, ctx=ctx, ignore_extra=ignore_extra, allow_missing=allow_missing)

[2020-02-02T20:03:05.993Z]   File "/var/lib/jenkins/workspace/gluon-nlp-gpu-py3/conda/gpu/py3/lib/python3.5/site-packages/mxnet/gluon/block.py", line 555, in load_parameters

[2020-02-02T20:03:05.993Z]     params[name]._load_init(loaded[name], ctx, cast_dtype=cast_dtype, dtype_source=dtype_source)

[2020-02-02T20:03:05.993Z]   File "/var/lib/jenkins/workspace/gluon-nlp-gpu-py3/conda/gpu/py3/lib/python3.5/site-packages/mxnet/gluon/parameter.py", line 310, in _load_init

[2020-02-02T20:03:05.993Z]     self._init_impl(data, ctx)

[2020-02-02T20:03:05.993Z]   File "/var/lib/jenkins/workspace/gluon-nlp-gpu-py3/conda/gpu/py3/lib/python3.5/site-packages/mxnet/gluon/parameter.py", line 359, in _init_impl

[2020-02-02T20:03:05.993Z]     self._data = [data.copyto(ctx) for ctx in self._ctx_list]

[2020-02-02T20:03:05.993Z]   File "/var/lib/jenkins/workspace/gluon-nlp-gpu-py3/conda/gpu/py3/lib/python3.5/site-packages/mxnet/gluon/parameter.py", line 359, in <listcomp>

[2020-02-02T20:03:05.993Z]     self._data = [data.copyto(ctx) for ctx in self._ctx_list]

[2020-02-02T20:03:05.993Z]   File "/var/lib/jenkins/workspace/gluon-nlp-gpu-py3/conda/gpu/py3/lib/python3.5/site-packages/mxnet/ndarray/ndarray.py", line 2632, in copyto

[2020-02-02T20:03:05.993Z]     return _internal._copyto(self, out=hret)

[2020-02-02T20:03:05.993Z]   File "<string>", line 27, in _copyto

[2020-02-02T20:03:05.993Z]   File "/var/lib/jenkins/workspace/gluon-nlp-gpu-py3/conda/gpu/py3/lib/python3.5/site-packages/mxnet/_ctypes/ndarray.py", line 107, in _imperative_invoke

[2020-02-02T20:03:05.993Z]     ctypes.byref(out_stypes)))

[2020-02-02T20:03:05.993Z]   File "/var/lib/jenkins/workspace/gluon-nlp-gpu-py3/conda/gpu/py3/lib/python3.5/site-packages/mxnet/base.py", line 255, in check_call

[2020-02-02T20:03:05.993Z]     raise MXNetError(py_str(_LIB.MXGetLastError()))

[2020-02-02T20:03:05.993Z] mxnet.base.MXNetError: [20:03:01] src/engine/threaded_engine.cc:333: Check failed: exec_ctx.dev_id < device_count_ (1 vs. 1) : Invalid GPU Id: 1, Valid device id should be less than device_count: 1

mli · 2020-02-03T03:28:46Z

Job PR-1130/11 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-1130/11/index.html

zburning and others added 30 commits October 25, 2019 09:58

Merge pull request #1 from dmlc/master

41e0830

test

Update transformer.py

a10c89a

Update transformer.py

23be6c6

Update transformer.py

090a6cd

Update transformer.py

358169f

deleting trailing white space

Merge pull request #2 from dmlc/master

f49b536

merge from master

Update transformer.py

278340c

Merge pull request #3 from dmlc/master

101ce13

merge from master

new features&test

b82b0e1

Merge remote-tracking branch 'upstream/master'

d9afa4d

fix

dd12bb3

fix lint

8b160d9

fix lint

6d553d6

fix lint

bf9297b

fix lint

99b8ebc

new

203e319

fix lint

1e81e55

new!

983dfcb

new!

f9952c2

fix test

60574c4

fix

4456dce

new

4fc4588

fix

00c8ff9

merge two truncate func

7b1749a

add more features

06701ce

refactor

ec86832

fix lint

36f0905

fix vocab & refactor XLNet script

b6d038d

fix pylint

ea72d10

fix conflict

4fda57b

Wang added 3 commits January 30, 2020 12:56

fix pylint

2108be2

fix preprocessing

bc317b0

update index.rst

5f7b647

zburning requested a review from a team as a code owner January 30, 2020 05:08

Wang added 2 commits January 30, 2020 13:22

fix conflict

c2a4255

fix doc

57832eb

update index

5d501d7

add round_to feature

c02f14b

zburning force-pushed the squad_xlnet branch from e45da3e to c02f14b Compare February 1, 2020 04:59

leezu reviewed Feb 1, 2020

View reviewed changes

scripts/language_model/model/qa.py Show resolved Hide resolved

scripts/bert/data/preprocessing_utils.py Show resolved Hide resolved

hybridize

7bf89fe

Wang added 4 commits February 3, 2020 01:32

Merge remote-tracking branch 'upstream/master' into squad_xlnet

8dd508b

update comments&test

fba5506

fix

386787d

update round_to

b05cd8e

reduce GPU number

8338e5a

leezu approved these changes Feb 3, 2020

View reviewed changes

leezu merged commit 1788c35 into dmlc:master Feb 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SCRIPT] XLNet squad finetuning scripts #1130

[SCRIPT] XLNet squad finetuning scripts #1130

zburning commented Jan 30, 2020

codecov bot commented Jan 30, 2020 •

edited

Loading

mli commented Jan 30, 2020

eric-haibin-lin commented Jan 31, 2020

mli commented Jan 31, 2020

mli commented Jan 31, 2020

mli commented Feb 1, 2020

mli commented Feb 2, 2020

mli commented Feb 2, 2020

mli commented Feb 2, 2020

mli commented Feb 2, 2020

leezu commented Feb 3, 2020

mli commented Feb 3, 2020

[SCRIPT] XLNet squad finetuning scripts #1130

[SCRIPT] XLNet squad finetuning scripts #1130

Conversation

zburning commented Jan 30, 2020

Description

Checklist

Essentials

Changes

Comments

codecov bot commented Jan 30, 2020 • edited Loading

Codecov Report

mli commented Jan 30, 2020

eric-haibin-lin commented Jan 31, 2020

mli commented Jan 31, 2020

mli commented Jan 31, 2020

mli commented Feb 1, 2020

mli commented Feb 2, 2020

mli commented Feb 2, 2020

mli commented Feb 2, 2020

mli commented Feb 2, 2020

leezu commented Feb 3, 2020

mli commented Feb 3, 2020

codecov bot commented Jan 30, 2020 •

edited

Loading