Size mismatch error when loading pretrained/finetuned model #10

fazliimam · 2024-02-06T07:06:40Z

Hi, I get the following error when I try to load the finetune temporal checkpoint vit large model:
RuntimeError: Error(s) in loading state_dict for VisionTransformer: size mismatch for pos_embed: copying a param with shape torch.Size([1, 197, 640]) from checkpoint, the shape in current model is torch.Size([1, 197, 1024]).

I get similar shape mismatch error when I tried loading other saved checkpoints as well

The text was updated successfully, but these errors were encountered:

jayshrivastava0 · 2024-03-15T04:03:04Z

Hi, I was facing the similar problem.

RuntimeError: Error(s) in loading state_dict for GroupChannelsVisionTransformer:
	size mismatch for cls_token: copying a param with shape torch.Size([1, 1, 768]) from checkpoint, the shape in current model is torch.Size([1, 1, 1024]).
	size mismatch for patch_embed.0.proj.weight: copying a param with shape torch.Size([768, 4, 8, 8]) from checkpoint, the shape in current model is torch.Size([1024, 4, 8, 8]).

I looked into the documentation, this error was resolved using ViT Large, instead of ViT base, I believe they have changed the size for ViT Base since this repo used it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Size mismatch error when loading pretrained/finetuned model #10

Size mismatch error when loading pretrained/finetuned model #10

fazliimam commented Feb 6, 2024 •

edited

Loading

jayshrivastava0 commented Mar 15, 2024 •

edited

Loading

Size mismatch error when loading pretrained/finetuned model #10

Size mismatch error when loading pretrained/finetuned model #10

Comments

fazliimam commented Feb 6, 2024 • edited Loading

jayshrivastava0 commented Mar 15, 2024 • edited Loading

fazliimam commented Feb 6, 2024 •

edited

Loading

jayshrivastava0 commented Mar 15, 2024 •

edited

Loading