Unify algorithms part 2: mixup, cutmix, label smoothing #658

dblalock · 2022-03-03T06:47:38Z

Follow-up to #524 to continue addressing #343. This is mostly cleaning up the docs to be consistent across methods + match the code's behavior, fixing the doctests, and modifying the docstrings in accordance with some slight API changes (mostly param renaming). These changes are:

Mixup:

n_classes -> num_classes in mixup_batch, for consitency with torch + rest of composer rather than sklearn.
interpolation_lambda -> mixing in mixup_batch
x -> input in mixup_batch for consistency with torch
y -> target in mixup_batch for consistency with torch
Removed the wrapper properties for attributes in the algorithm. These weren't documented and were seemingly (?) only there so that the tests didn't access private attrs

Cutmix:

n_classes -> num_classes again
X -> input in cutmix_batch for consistency with torch
y -> target in cutmix_batch for consistency with torch
cutmix_lambda -> length in cutmix_batch, matching cutout's length argument. As part of this change, I also added changed the logic to 1) correctly compute the internal cutmix_lambda when bbox is provided, and 2) throw if both length and bbox are provided.
cutmix_batch now returns the permutation it used for consistency with mixup_batch. @coryMosaicML , do we need to return these? We apparently didn't here, but seemingly did in mixup_batch?
Removed the wrapper properties for attributes for same reasons as in mixup

Label smoothing:

alpha -> interpolation in smooth_labels and algorithm
targets -> target in smooth_labels for consistency with torch

Parts I'm not sure about (updated after slack discussion):

Whether to return the permutation in functional forms (see above)
How best to handle cutmix allowing 3 different ways to specify the box to use (alpha, length / cutmix_lambda, and bbox). Right now alpha is just ignored if others are provided, while the other two now raise a ValueError if both are not None. Seems like the least bad way to do this, but a little inconsistent. Maybe we could remove bbox as a parameter?
For removed wrapper properties, we'd ideally just not have the tests depend on class internals. I went with having tests access the private attrs rather than cluttering the code + public API, but we could also just make these attrs public and document them. Probably should have put this change in a separate PR, but I was already renaming the attrs and I was on a roll / changing a lot anyway, so I just went with it.

NOTE: need to fix jenkins error + double-check docstring output in the docs, but this is ready for feedback, esp regarding the API questions above.

…behavior

…rpolation

hanlint · 2022-03-03T20:29:58Z

I like where this is going! Agreed on interpolation... how strictly do we want to maintain cross-functional consistency versus what the original paper/authors terminology. e.g. alpha for label smoothing?

coryMosaicML · 2022-03-04T19:10:10Z

cutmix_batch now returns the permutation it used for consistency with mixup_batch. @coryMosaicML , do we need to return these? We apparently didn't here, but seemingly did in mixup_batch?

I think this behavior in mixup_batch is a holdover from the old repo that got ported over. I don't think the original reason for doing that exists in composer, and not returning the permutation should be fine. It makes the functional interface feel a lot cleaner to me. However, two possible reasons to return permutations: 1) If someone wants to do further analysis/debugging of what mixup_batch and cutmix_batch are doing to the inputs, this could be handy, and 2) returning the permutation makes it possible to use the loss interpolation trick people often do to use mixup and cutmix with index labels after using mixup_batch and cutmix_batch if they so choose.

composer/algorithms/cutmix/cutmix.py

composer/algorithms/mixup/mixup.py

tests/algorithms/test_mixup.py

hanlint

I noticed two instances of significant test setup code to create the model and dataloaders. Are those needed, given that some of these are auto-loaded in https://github.com/mosaicml/composer/blob/dev/docs/source/doctest_fixtures.py ?

composer/algorithms/cutmix/cutmix.py

hanlint · 2022-03-07T19:14:20Z

Also -- update yamls?

dblalock · 2022-03-08T04:21:17Z

Also -- update yamls?

The algorithm yamls? The label_smoothing one is updated, but cutmix and mixup fortunately don't have to change anything since they only use alpha and num_classes, and these are unchanged.

…er into davis/unify-functional-2

hanlint

LGTM

…er into davis/unify-functional-2

Addresses #343, or at least well enough for present purposes. Changes: * colout.py: rename X -> input * cutout.py: rename X -> input, n_holes -> num_holes, make cutout_batch length fractional-only to match cutmix.py (see discussion in #658) * account for n_holes -> num_holes rename, as well as no int lengths in: * test_cutout.py * algorithms/hparams.py * yamls/algorithms/cutout.yaml * test_algorithm_registry.py * test_load.py * test_trainer.py * examples/adding_custom_models.py * notebooks/medical_image_segmentation_composer.ipynb * progressive_resizing.py: rename X -> input, y -> target * selective_backprop.py: rename X -> input, y -> target Minor docstring cleanup: - augmix_image now has correct input type shown in docstring - randaugment_image now has correct input type shown in docstring

dblalock added 10 commits March 2, 2022 14:05

unify cutmix function signature + fix doctest + have docstring match …

d7e5506

…behavior

more cutmix cleanup + unification with revised mixup

cd50187

get cutmix algo doctest running (at least in my setup)

901b76b

have cutmix tests work with returned indices

9afda86

cleanup + consistentize mixup

1a72ad6

rm cutmix wrapper properties used only for testing

fed8192

label string doctest fix + docstrings matching behavior + alpha->inte…

a8aafe0

…rpolation

slight docs fixes

3a769cd

update cutmix and label_smoothing method cards

e1420a0

pyright

5b60fe2

dblalock requested review from coryMosaicML and hanlint March 3, 2022 06:49

dblalock added 2 commits March 2, 2022 22:53

Merge branch 'dev' into davis/unify-functional-2

8fc8d24

Merge branch 'dev' into davis/unify-functional-2

9278855

dblalock marked this pull request as ready for review March 3, 2022 20:44

dblalock added 14 commits March 3, 2022 16:15

Merge branch 'dev' into davis/unify-functional-2

fc6d830

fix error added to cutmix

761e759

name label smoothing parameter "smoothing"

0aa9413

rename mixup interpolation->mixing

46a6537

have cutmix_batch accept length arg like cutout_batch

63453e8

yapf + pyright

9407747

merge dev

bc36d39

fix test_load failure by updating algorithm yaml

671b23f

update method cards

c7079a2

fix docstrings based on sphinx output

e74e2af

rename X -> input, y -> target, targets -> target

469bf3a

appease docformatter

60f6145

fix broken example that needed old label smoothing param name

bcda7b5

fix doctests

22ae502

Merge branch 'dev' into davis/unify-functional-2

6919cd9

coryMosaicML approved these changes Mar 4, 2022

View reviewed changes

hanlint reviewed Mar 7, 2022

View reviewed changes

composer/algorithms/cutmix/cutmix.py Outdated Show resolved Hide resolved

composer/algorithms/cutmix/cutmix.py Outdated Show resolved Hide resolved

composer/algorithms/cutmix/cutmix.py Outdated Show resolved Hide resolved

dblalock added 4 commits March 7, 2022 19:56

merge dev

7c1a261

only allow fractional cutmix_batch length

4bd92fd

mixup renaming as per pr comments

ffa4f03

have cutmix not return permuation used

b7dafd7

dblalock added 3 commits March 7, 2022 21:00

shorten doctests (turns out we have doctest fixtures too...)

9916de8

Merge branch 'davis/unify-functional-2' of github.com:mosaicml/compos…

bf3984b

…er into davis/unify-functional-2

Merge branch 'dev' into davis/unify-functional-2

49ef654

hanlint approved these changes Mar 8, 2022

View reviewed changes

dblalock added 5 commits March 8, 2022 10:01

Merge branch 'dev' into davis/unify-functional-2

513496f

fix trainer test failing from algo kwarg change

9d876ef

make style

7406b5f

rm unused import

225d7eb

Merge branch 'davis/unify-functional-2' of github.com:mosaicml/compos…

0664325

…er into davis/unify-functional-2

dblalock merged commit 7bfac7a into dev Mar 8, 2022

dblalock deleted the davis/unify-functional-2 branch March 8, 2022 20:00

dblalock mentioned this pull request Mar 10, 2022

Unify functional API part 3 #715

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unify algorithms part 2: mixup, cutmix, label smoothing #658

Unify algorithms part 2: mixup, cutmix, label smoothing #658

dblalock commented Mar 3, 2022 •

edited

Loading

hanlint commented Mar 3, 2022

coryMosaicML commented Mar 4, 2022

hanlint left a comment

hanlint commented Mar 7, 2022

dblalock commented Mar 8, 2022

hanlint left a comment

Unify algorithms part 2: mixup, cutmix, label smoothing #658

Unify algorithms part 2: mixup, cutmix, label smoothing #658

Conversation

dblalock commented Mar 3, 2022 • edited Loading

hanlint commented Mar 3, 2022

coryMosaicML commented Mar 4, 2022

hanlint left a comment

Choose a reason for hiding this comment

hanlint commented Mar 7, 2022

dblalock commented Mar 8, 2022

hanlint left a comment

Choose a reason for hiding this comment

dblalock commented Mar 3, 2022 •

edited

Loading