Example how to pretrain lm + introduction of config_name #57

PiotrCzapla · 2019-11-11T09:55:15Z

I've added ability to limit training set so we can use a test configuration 'multifit_mini_test` that executes in ~20 secs to test that the scripts are working.

Why config_name?

I've added it so we can know what training parameters we should load for the finetune-lm and and classifier. This parameters aren't stored along with a language model, only parameters used to build that model are saved.

So one can pretrain a language model from commandline The limit was added to support quick tests

sebastianruder · 2019-11-17T10:23:27Z

multifit/training.py

@@ -280,7 +288,7 @@ def train_(self, dataset_or_path, tokenizer=None, **train_config):
        print("Language model saved to", self.experiment_path)

    def validate(self):
-        raise NotImplementedError("The validation on the language model is not implemented.")
+        return "not implemented"


Do we really just want to return a string here?

sebastianruder · 2019-11-17T10:25:01Z

README.md

+From command line:
+```
+    $ bash prepare_wiki.sh de
+    $ python -W ignore -m multifit new multifit_paper_version replace_ --name my_lm - train_ --pretrain-dataset data/wiki/de-100 


Looks like there's a superfluous space between - and train-. Why do we use train_ here? What is the difference between train_ and train?

sebastianruder

Hi Piotr, thanks for adding this. Looks good in general. I've added a few comments about minor things. In general, do you think it'd be possible to add a few short docstrings to explain things like bs, bptt, limit in load_lm_databunch for people not familiar with the library?

PiotrCzapla added 3 commits November 11, 2019 10:38

Add command line convenience function & ability limit data set

9eac0bb

So one can pretrain a language model from commandline The limit was added to support quick tests

Make form_pretrained accept a path as well as a pretarined model name

8f23d39

Update README.md with an example how to train your own language model

f4ece1c

PiotrCzapla requested a review from sebastianruder November 11, 2019 09:55

sebastianruder reviewed Nov 17, 2019

View reviewed changes

sebastianruder approved these changes Nov 17, 2019

View reviewed changes

sebastianruder mentioned this pull request Nov 17, 2019

Train on my own data #59

Closed

blazejdolicki mentioned this pull request Mar 25, 2020

error in LM pretraining #62

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Example how to pretrain lm + introduction of config_name #57

Example how to pretrain lm + introduction of config_name #57

PiotrCzapla commented Nov 11, 2019

sebastianruder Nov 17, 2019

sebastianruder Nov 17, 2019 •

edited

Loading

sebastianruder left a comment

Example how to pretrain lm + introduction of config_name #57

Are you sure you want to change the base?

Example how to pretrain lm + introduction of config_name #57

Conversation

PiotrCzapla commented Nov 11, 2019

Why config_name?

sebastianruder Nov 17, 2019

Choose a reason for hiding this comment

sebastianruder Nov 17, 2019 • edited Loading

Choose a reason for hiding this comment

sebastianruder left a comment

Choose a reason for hiding this comment

sebastianruder Nov 17, 2019 •

edited

Loading