[torchvision][Bug-fix] ignore state dict error on transfer learning tasks + use PythonLogger default logger #1455

KSGulin · 2023-03-17T16:08:58Z

When loading a pre-trained torchvision model, an error will occur if the number of classes in the target dataset doesn't match the number of classes in the pre-trained model. e.g. when using a smaller subset of the original dataset. This PR fixes that issue by ignoring the classification head in the loaded model dict. Note that in some cases (such as inceptionet) it will fail, as for some models the classification head naming doesn't follow the standard naming pattern.

Test plan
sparseml.image_classification.train --checkpoint-path verizon_dense.pt --arch-key densenet121 --dataset-path /network/datasets/imagenette-160/imagenette-160 --pretrained True

src/sparseml/pytorch/torchvision/train.py

* comments

…asks + use PythonLogger default logger (#1455) * Remove cf from native torchvision models * * do not pass default logger to PythonLogger * comments --------- Co-authored-by: Damian <damian@neuralmagic.com> Co-authored-by: Benjamin <ben@neuralmagic.com>

…ansfer learning tasks + use PythonLogger default logger #1455 (#1460) * [torchvision][Bug-fix] ignore state dict error on transfer learning tasks + use PythonLogger default logger (#1455) * Remove cf from native torchvision models * * do not pass default logger to PythonLogger * comments --------- Co-authored-by: Damian <damian@neuralmagic.com> Co-authored-by: Benjamin <ben@neuralmagic.com> * [torchvision] add ignore error tensors back to optional checkpoint load (#1459) --------- Co-authored-by: Konstantin Gulin <66528950+KSGulin@users.noreply.github.com> Co-authored-by: Damian <damian@neuralmagic.com>

…ansfer learning tasks + use PythonLogger default logger #1455 (#1461) * [torchvision][Bug-fix] ignore state dict error on transfer learning tasks + use PythonLogger default logger (#1455) * Remove cf from native torchvision models * * do not pass default logger to PythonLogger * comments --------- Co-authored-by: Damian <damian@neuralmagic.com> Co-authored-by: Benjamin <ben@neuralmagic.com> * [torchvision] add ignore error tensors back to optional checkpoint load (#1459) --------- Co-authored-by: Konstantin Gulin <66528950+KSGulin@users.noreply.github.com> Co-authored-by: Damian <damian@neuralmagic.com>

KSGulin added the mle-team label Mar 17, 2023

KSGulin requested review from corey-nm and a team March 17, 2023 16:08

KSGulin self-assigned this Mar 17, 2023

KSGulin requested review from tdg5 and abhinavnmagic and removed request for a team March 17, 2023 16:09

rahul-tuli previously approved these changes Mar 17, 2023

View reviewed changes

dbogunowicz previously approved these changes Mar 17, 2023

View reviewed changes

corey-nm reviewed Mar 17, 2023

View reviewed changes

src/sparseml/pytorch/torchvision/train.py Outdated Show resolved Hide resolved

bfineran previously approved these changes Mar 17, 2023

View reviewed changes

KSGulin dismissed stale reviews from bfineran, dbogunowicz, and rahul-tuli via 9c156cd March 17, 2023 19:11

KSGulin force-pushed the ic_class_fix branch 2 times, most recently from 0f66a06 to 2fd4854 Compare March 17, 2023 19:17

Remove cf from native torchvision models

48b3e91

KSGulin force-pushed the ic_class_fix branch from 2fd4854 to 48b3e91 Compare March 17, 2023 19:19

Merge branch 'main' into ic_class_fix

0fd0227

bfineran reviewed Mar 17, 2023

View reviewed changes

src/sparseml/pytorch/torchvision/train.py Outdated Show resolved Hide resolved

* do not pass default logger to PythonLogger

e1e02c1

* comments

bfineran approved these changes Mar 17, 2023

View reviewed changes

bfineran changed the title ~~[Bug-fix] Don't override num_classes for pre-trained torchvision models~~ [torchvision][Bug-fix] ignore state dict error on transfer learning tasks + use PythonLogger default logger Mar 17, 2023

anmarques approved these changes Mar 17, 2023

View reviewed changes

bfineran merged commit 7ee620b into main Mar 17, 2023

bfineran deleted the ic_class_fix branch March 17, 2023 19:54

bfineran restored the ic_class_fix branch March 17, 2023 19:55

bfineran mentioned this pull request Mar 17, 2023

[torchvision] add ignore error tensors back to optional checkpoint load #1459

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[torchvision][Bug-fix] ignore state dict error on transfer learning tasks + use PythonLogger default logger #1455

[torchvision][Bug-fix] ignore state dict error on transfer learning tasks + use PythonLogger default logger #1455

KSGulin commented Mar 17, 2023 •

edited

Loading

[torchvision][Bug-fix] ignore state dict error on transfer learning tasks + use PythonLogger default logger #1455

[torchvision][Bug-fix] ignore state dict error on transfer learning tasks + use PythonLogger default logger #1455

Conversation

KSGulin commented Mar 17, 2023 • edited Loading

KSGulin commented Mar 17, 2023 •

edited

Loading