[SCRIPT] - Add static BERT base export script (for use with MXNet Module API) #672

gigasquid · 2019-04-20T23:31:34Z

Description

This will export the base BERT model for use with the MXNet Module API.

It was adapted from the static_export_squad.py

Use cases can include fine tuning for Clojure and Scala APIs.

Have used it successfully to reproduce https://gluon-nlp.mxnet.io/examples/sentence_embedding/bert.html in Clojure - PR in MXNet will be coming shortly

Checklist

Essentials

PR's title starts with a category (e.g. [BUGFIX], [MODEL], [TUTORIAL], [FEATURE], [DOC], etc)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented

Comments

Python is not my usual language - so feedback is welcome :)

codecov · 2019-04-20T23:42:56Z

Codecov Report

Merging #672 into master will increase coverage by 0.1%.
The diff coverage is n/a.

@@            Coverage Diff            @@
##           master     #672     +/-   ##
=========================================
+ Coverage   90.94%   91.04%   +0.1%     
=========================================
  Files          64       64             
  Lines        5887     5887             
=========================================
+ Hits         5354     5360      +6     
+ Misses        533      527      -6

Impacted Files	Coverage Δ
src/gluonnlp/data/dataloader.py	`88.79% <0%> (+5.17%)`	⬆️

gigasquid · 2019-04-21T00:20:04Z

Nice bot 🤖

mli · 2019-04-21T00:37:23Z

Job PR-672/1 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-672/1/index.html

szha

Thanks for the contribution, @gigasquid (and glad to see you here!). Do you intend to use such model for feature extraction with mxnet module?

gigasquid · 2019-04-22T00:35:31Z

@szha The intent is to use of BERT to more than just inference in the Clojure/Scala MXNet apis. Initially, I'm interested in the fine tuning tasks like classification, but feature extraction would be cool too. Basically everything in this http://jalammar.github.io/illustrated-bert/

Thanks so much for the static export feature! It opens up a whole new BERT world for the JVM MXNet langs 💯

@haven-jeon - Good point on the parameters. I'll double check to see if they are all used and remove the ones that are not.

gigasquid · 2019-04-25T23:55:09Z

I was able to put together a walkthrough Clojure jupyter notebook and then export it to markdown 😸
https://github.com/gigasquid/incubator-mxnet/blob/new-bert-example-with-finetuning/contrib/clojure-package/examples/bert/fine-tune-bert.md

The PR for MXNet is here apache/mxnet#14769

I'll plan on spending some time tomorrow to double check this script and the params.

gigasquid · 2019-04-26T18:04:19Z

@haven-jeon - I cleaned up some unneeded args and checked the rest.
Tested with python static_export_base.py --seq_length 128

Please take another look when you have a chance.

mli · 2019-04-30T20:55:37Z

Job PR-672/5 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-672/5/index.html

scripts/bert/staticbert/static_export_base.py

szha · 2019-05-06T00:19:20Z

@gigasquid we recently upgraded the CI setup. It will work once you rebase the PR to latest master.

mli · 2019-05-06T17:43:27Z

Job PR-672/1 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-672/1/index.html

mli · 2019-05-06T18:06:07Z

Job PR-672/3 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-672/3/index.html

add docs add test adjust params

mli · 2019-05-06T18:12:40Z

Job PR-672/4 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-672/4/index.html

mli · 2019-05-06T18:27:30Z

Job PR-672/5 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-672/5/index.html

mli · 2019-05-06T19:05:56Z

Job PR-672/7 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-672/7/index.html

mli · 2019-05-06T19:16:30Z

Job PR-672/8 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-672/8/index.html

mli · 2019-05-06T19:50:49Z

Job PR-672/10 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-672/10/index.html

gigasquid · 2019-05-07T00:10:37Z

tests & docs adds and CI is green 💚 - please take another look when you get a chance

eric-haibin-lin

nice work!

…ule API) (#672) * Add static BERT base export (for using with MXNet Module API) add docs add test adjust params * remove unused out variable * add test and tweak doc

…ule API) (dmlc#672) * Add static BERT base export (for using with MXNet Module API) add docs add test adjust params * remove unused out variable * add test and tweak doc

gigasquid requested a review from szha as a code owner April 20, 2019 23:31

szha reviewed Apr 21, 2019

View reviewed changes

szha approved these changes Apr 23, 2019

View reviewed changes

szha requested a review from haven-jeon April 23, 2019 18:57

gigasquid mentioned this pull request Apr 26, 2019

[Clojure] Add Fine Tuning Sentence Pair Classification BERT Example apache/mxnet#14769

Merged

4 tasks

eric-haibin-lin reviewed May 6, 2019

View reviewed changes

scripts/bert/staticbert/static_export_base.py Show resolved Hide resolved

gigasquid force-pushed the static-export-bert-base branch 2 times, most recently from 48bdfc3 to 965e799 Compare May 6, 2019 17:10

gigasquid force-pushed the static-export-bert-base branch from 4538d52 to ff8a1f4 Compare May 6, 2019 18:02

Add static BERT base export (for using with MXNet Module API)

54dfc1d

add docs add test adjust params

gigasquid force-pushed the static-export-bert-base branch from ff8a1f4 to 54dfc1d Compare May 6, 2019 18:08

remove unused out variable

fdc502b

add test and tweak doc

e50f320

gigasquid force-pushed the static-export-bert-base branch from 57d393a to e50f320 Compare May 6, 2019 18:55

eric-haibin-lin approved these changes May 7, 2019

View reviewed changes

eric-haibin-lin merged commit 5216840 into dmlc:master May 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SCRIPT] - Add static BERT base export script (for use with MXNet Module API) #672

[SCRIPT] - Add static BERT base export script (for use with MXNet Module API) #672

gigasquid commented Apr 20, 2019

codecov bot commented Apr 20, 2019 •

edited

Loading

gigasquid commented Apr 21, 2019

mli commented Apr 21, 2019

szha left a comment

gigasquid commented Apr 22, 2019

gigasquid commented Apr 25, 2019 •

edited

Loading

gigasquid commented Apr 26, 2019

mli commented Apr 30, 2019

szha commented May 6, 2019

mli commented May 6, 2019

mli commented May 6, 2019

mli commented May 6, 2019

mli commented May 6, 2019

mli commented May 6, 2019

mli commented May 6, 2019

mli commented May 6, 2019

gigasquid commented May 7, 2019

eric-haibin-lin left a comment

[SCRIPT] - Add static BERT base export script (for use with MXNet Module API) #672

[SCRIPT] - Add static BERT base export script (for use with MXNet Module API) #672

Conversation

gigasquid commented Apr 20, 2019

Description

Checklist

Essentials

Comments

codecov bot commented Apr 20, 2019 • edited Loading

Codecov Report

gigasquid commented Apr 21, 2019

mli commented Apr 21, 2019

szha left a comment

Choose a reason for hiding this comment

gigasquid commented Apr 22, 2019

gigasquid commented Apr 25, 2019 • edited Loading

gigasquid commented Apr 26, 2019

mli commented Apr 30, 2019

szha commented May 6, 2019

mli commented May 6, 2019

mli commented May 6, 2019

mli commented May 6, 2019

mli commented May 6, 2019

mli commented May 6, 2019

mli commented May 6, 2019

mli commented May 6, 2019

gigasquid commented May 7, 2019

eric-haibin-lin left a comment

Choose a reason for hiding this comment

codecov bot commented Apr 20, 2019 •

edited

Loading

gigasquid commented Apr 25, 2019 •

edited

Loading