Distillation support for torchvision script #1310

rahul-tuli · 2023-01-10T18:04:21Z

The goal of this PR is to add distillation support to our pytorch/torchvision integration
Test recipe:

distillation.yaml:

num_epochs: &num_epochs 10
lr: &lr 0.008

training_modifiers:
  - !EpochRangeModifier
    start_epoch: 0.0
    end_epoch: *num_epochs

  - !SetLearningRateModifier
    start_epoch: 0.0
    learning_rate: *lr

  - !DistillationModifier
     start_epoch: 0.0
     hardness: 0.5
     temperature: 2.0
     distill_output_keys: [0]

Test commands, (run manually):

self distillation

sparseml.image_classification.train \ 
    --recipe distillation.yaml --pretrained True --pretrained-dataset imagenette \
    --arch-key resnet50 --dataset-path /home/rahul/datasets/imagenette/imagenette-160 \
    --batch-size 128 --opt SGD --output-dir ./training-runs/image_classification-pretrained \
    --distill-teacher self

This command should fail (distillation recipe given but no --distill-teacher specified)

sparseml.image_classification.train \ 
    --recipe distillation.yaml --pretrained True --pretrained-dataset imagenette \
    --arch-key resnet50 --dataset-path /home/rahul/datasets/imagenette/imagenette-160 \
    --batch-size 128 --opt SGD --output-dir ./training-runs/image_classification-pretrained

Disable distillation

sparseml.image_classification.train \ 
    --recipe distillation.yaml --pretrained True --pretrained-dataset imagenette \
    --arch-key resnet50 --dataset-path /home/rahul/datasets/imagenette/imagenette-160 \
    --batch-size 128 --opt SGD --output-dir ./training-runs/image_classification-pretrained \
    --distill-teacher disable

Distill a mobilenet using a resnet50 teacher from sparsezoo

sparseml.image_classification.train \ 
    --recipe distillation.yaml --pretrained True --pretrained-dataset imagenette \
    --arch-key mobilenet --dataset-path /home/rahul/datasets/imagenette/imagenette-160 \
    --batch-size 128 --opt SGD --output-dir ./training-runs/image_classification-pretrained \
    --distill-teacher zoo:cv/classification/resnet_v1-50/pytorch/sparseml/imagenet/base-none \
    --pretrained-teacher-dataset imagenet --teacher-arch-key resnet50

corey-nm

changes look good, but need to call manager.update_loss (or whatever the call is) to actually use distillation loss

rahul-tuli · 2023-01-11T04:25:28Z

changes look good, but need to call manager.update_loss (or whatever the call is) to actually use distillation loss

Great catch, updated!

KSGulin

Left one comment, otherwise LGTM!

src/sparseml/pytorch/torchvision/train.py

bfineran

Looks great @rahul-tuli

src/sparseml/pytorch/torchvision/train.py

corey-nm

Looks like loss_update returns the new loss to use! So close 😀

corey-nm

🚀 LETS GOOOOOO

src/sparseml/pytorch/torchvision/train.py

Co-authored-by: corey-nm <109536191+corey-nm@users.noreply.github.com>

* Add support for `self` distillation and `disable` * Pull out model creation into a method * Add support to distill with another model * Add modifier loss update before backward pass * bugfix, set loss * Update src/sparseml/pytorch/torchvision/train.py Co-authored-by: corey-nm <109536191+corey-nm@users.noreply.github.com> Co-authored-by: corey-nm <109536191+corey-nm@users.noreply.github.com>

Add support for self distillation and disable

c515feb

rahul-tuli requested review from abhinavnmagic, bfineran, corey-nm, dbogunowicz and KSGulin January 10, 2023 18:04

rahul-tuli self-assigned this Jan 10, 2023

rahul-tuli added the mle-team label Jan 10, 2023

rahul-tuli added 3 commits January 10, 2023 13:04

Merge branch 'main' into feature/torchvision-distillation-support

188ae0d

Pull out model creation into a method

4d8d0e6

Add support to distill with another model

33041d8

rahul-tuli marked this pull request as ready for review January 10, 2023 18:51

rahul-tuli changed the title ~~[WIP] Distillation support for torchvision script~~ Distillation support for torchvision script Jan 10, 2023

corey-nm suggested changes Jan 10, 2023

View reviewed changes

Add modifier loss update before backward pass

11949bd

KSGulin previously approved these changes Jan 11, 2023

View reviewed changes

src/sparseml/pytorch/torchvision/train.py Show resolved Hide resolved

Merge branch 'main' into feature/torchvision-distillation-support

2b2af45

bfineran previously approved these changes Jan 11, 2023

View reviewed changes

corey-nm approved these changes Jan 11, 2023

View reviewed changes

corey-nm reviewed Jan 11, 2023

View reviewed changes

src/sparseml/pytorch/torchvision/train.py Outdated Show resolved Hide resolved

corey-nm suggested changes Jan 11, 2023

View reviewed changes

bugfix, set loss

735956e

rahul-tuli dismissed stale reviews from bfineran and KSGulin via 735956e January 11, 2023 14:48

corey-nm previously approved these changes Jan 11, 2023

View reviewed changes

bfineran previously approved these changes Jan 11, 2023

View reviewed changes

Merge branch 'main' into feature/torchvision-distillation-support

6be8dd8

rahul-tuli dismissed stale reviews from bfineran and corey-nm via 6be8dd8 January 11, 2023 15:02

KSGulin previously approved these changes Jan 11, 2023

View reviewed changes

corey-nm reviewed Jan 11, 2023

View reviewed changes

src/sparseml/pytorch/torchvision/train.py Outdated Show resolved Hide resolved

corey-nm reviewed Jan 11, 2023

View reviewed changes

src/sparseml/pytorch/torchvision/train.py Show resolved Hide resolved

Update src/sparseml/pytorch/torchvision/train.py

2f7cacb

Co-authored-by: corey-nm <109536191+corey-nm@users.noreply.github.com>

rahul-tuli dismissed KSGulin’s stale review via 2f7cacb January 11, 2023 15:07

corey-nm approved these changes Jan 11, 2023

View reviewed changes

KSGulin approved these changes Jan 11, 2023

View reviewed changes

rahul-tuli merged commit adb30a0 into main Jan 11, 2023

rahul-tuli deleted the feature/torchvision-distillation-support branch January 11, 2023 15:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distillation support for torchvision script #1310

Distillation support for torchvision script #1310

rahul-tuli commented Jan 10, 2023 •

edited

Loading

corey-nm left a comment

rahul-tuli commented Jan 11, 2023

KSGulin left a comment

bfineran left a comment

corey-nm left a comment

corey-nm left a comment

Distillation support for torchvision script #1310

Distillation support for torchvision script #1310

Conversation

rahul-tuli commented Jan 10, 2023 • edited Loading

corey-nm left a comment

Choose a reason for hiding this comment

rahul-tuli commented Jan 11, 2023

KSGulin left a comment

Choose a reason for hiding this comment

bfineran left a comment

Choose a reason for hiding this comment

corey-nm left a comment

Choose a reason for hiding this comment

corey-nm left a comment

Choose a reason for hiding this comment

rahul-tuli commented Jan 10, 2023 •

edited

Loading