Add fine-tune presets to ModelTrainer #2439

alanakbik · 2021-09-16T10:01:31Z

This is a first step of refactoring the ModelTrainer to make fine-tuning more convenient. Essentially this adds a fine_tune routine that sets default parameters used for fine-tuning (AdamW optimizer, small learning rate, few epochs, cyclic learning rate scheduling, etc.).

New syntax with fine_tune method:

from flair.data import Corpus
from flair.datasets import TREC_6
from flair.embeddings import TransformerDocumentEmbeddings
from flair.models import TextClassifier
from flair.trainers import ModelTrainer

# 1. get the corpus
corpus: Corpus = TREC_6()

# 2. what label do we want to predict?
label_type = 'question_class'

# 3. create the label dictionary
label_dict = corpus.make_label_dictionary(label_type=label_type)

# 4. initialize transformer document embeddings (many models are available)
document_embeddings = TransformerDocumentEmbeddings('distilbert-base-uncased', fine_tune=True)

# 5. create the text classifier
classifier = TextClassifier(document_embeddings, label_dictionary=label_dict, label_type=label_type)

# 6. initialize trainer
trainer = ModelTrainer(classifier, corpus)

# 7. run training with fine-tuning
trainer.fine_tune('resources/taggers/question-classification-with-transformer',
                  learning_rate=5.0e-5,
                  mini_batch_size=4,
                  )

Old syntax (from tutorial):

import torch
from torch.optim.lr_scheduler import OneCycleLR

from flair.data import Corpus
from flair.datasets import TREC_6
from flair.embeddings import TransformerDocumentEmbeddings
from flair.models import TextClassifier
from flair.trainers import ModelTrainer

# 1. get the corpus
corpus: Corpus = TREC_6()

# 2. what label do we want to predict?
label_type = 'question_class'

# 3. create the label dictionary
label_dict = corpus.make_label_dictionary(label_type=label_type)

# 4. initialize transformer document embeddings (many models are available)
document_embeddings = TransformerDocumentEmbeddings('distilbert-base-uncased', fine_tune=True)

# 5. create the text classifier
classifier = TextClassifier(document_embeddings, label_dictionary=label_dict, label_type=label_type)

# 6. initialize trainer with AdamW optimizer
trainer = ModelTrainer(classifier, corpus, optimizer=torch.optim.AdamW)

# 7. run training with fine-tuning
trainer.train('resources/taggers/question-classification-with-transformer',
              learning_rate=5.0e-5,
              mini_batch_size=4,
              max_epochs=10,
              scheduler=OneCycleLR,
              embeddings_storage_mode='none',
              weight_decay=0.,
              )

alanakbik added 4 commits September 16, 2021 11:40

Refactor trainer to make fine-tuning more convenient

051f3e9

Refactor trainer to make fine-tuning more convenient

1625960

Adapt unit tests for new Trainer call

dadade4

Fix unit test

e86ac99

alanakbik merged commit 911d9d6 into master Sep 16, 2021

alanakbik deleted the fine-tune branch September 16, 2021 12:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fine-tune presets to ModelTrainer #2439

Add fine-tune presets to ModelTrainer #2439

alanakbik commented Sep 16, 2021

Add fine-tune presets to ModelTrainer #2439

Add fine-tune presets to ModelTrainer #2439

Conversation

alanakbik commented Sep 16, 2021