num_labels parameter needed in from_pretrained to load certain bert m… #80

oliverjarvis · 2020-10-23T12:44:43Z

Pull request in order to fix a mismatch in model and weight size when loading the bert models.
Specifically this error appeared when loading at least BertTone
RuntimeError: Error(s) in loading state_dict for BertForSequenceClassification: size mismatch for classifier.weight: copying a param with shape torch.Size([3, 768]) from checkpoint, the shape in current model is torch.Size([2, 768]). size mismatch for classifier.bias: copying a param with shape torch.Size([3]) from checkpoint, the shape in current model is torch.Size([2]).

Furthermore a duplicate load_bert_tone_model was found.

…odels num_label variables switched Added correct num_labels to pre_trained in order to correctly load models

AmaliePauli · 2020-11-02T14:16:36Z

Thanks @saxogrammaticus

num_labels parameter needed in from_pretrained to load certain bert m…

4a0a167

…odels num_label variables switched Added correct num_labels to pre_trained in order to correctly load models

AmaliePauli merged commit 6f107c9 into alexandrainst:master Nov 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

num_labels parameter needed in from_pretrained to load certain bert m… #80

num_labels parameter needed in from_pretrained to load certain bert m… #80

oliverjarvis commented Oct 23, 2020

AmaliePauli commented Nov 2, 2020

num_labels parameter needed in from_pretrained to load certain bert m… #80

num_labels parameter needed in from_pretrained to load certain bert m… #80

Conversation

oliverjarvis commented Oct 23, 2020

AmaliePauli commented Nov 2, 2020