DeepLabv3 + ADE20k benchmark #107

Landanjs · 2021-11-29T18:53:54Z

Pull request to add the DeepLabv3 model and the ADE20k dataset as a new benchmark for semantic segmentation tasks

Some results with 4 seeds for each experiment:

Experiment	Final	Best
Train for 127 epochs	44.888 +/- 0.304	44.959 +/- 0.307
Train for 64 epochs	45.051 +/ - 0.328	45.323 +/- 0.284
Use batch size 32 instead of 16	44.904 +/- 0.181	45.089 +/- 0.055
Decoupled SGD (WD = 5e-6	45.54 +/- 0.364	45.669 +/- 0.284

Here is a convergence run.

mmsegmentation reports 44.08 mIoU and 45.00 mIoU for 64 and 127 epochs, respectively (here). I think these numbers are the final results for a single run.

Before Merging

Test for mean IoU
Test for each segmentation transformation

…ncBN

ravi-mosaicml · 2022-01-03T23:39:27Z

Would you be able to use monkeypatch to mock out the actual download during the tests (and instead have it return uninitialized weights)?

Landanjs · 2022-01-04T18:40:55Z

The pretrained weight download is done within torchvision, so I thought the easiest way to avoid the long download was to manually change is_backbone_pretrained in test_hparams.py and test_model_registry.py when DeepLabV3 is used.

hanlint · 2022-01-04T23:10:38Z

cc: @A-Jacobson or @coryMosaicML for a review here (vision team)

composer/models/deeplabv3/deeplabv3.py

composer/datasets/ade20k.py

composer/models/loss.py

composer/yamls/models/deeplabv3_ade20k.yaml

examples/run_mosaic_trainer.py

composer/yamls/models/deeplabv3_ade20k.yaml

Landanjs · 2022-01-12T23:53:37Z

After some thought and discussion, I think it makes the most sense to make the initial benchmark to be as close as possible to other benchmarks. This means reverting back to the original batch size, no decoupled weight decay, and using a Polynomial LR schedule instead of cosine decay.

@ravi-mosaicml and/or @A-Jacobson could y'all skim through these recent changes today or tomorrow?

Added code to use initializers in deeplabv3.py
Added PolynomialLR and PolynomialLRHparams to schedulers.py and in the required scheduler tests
Added test_loss.py for mIoU tests (subsequently made a tests/models directory and added test_efficientnet.py)
Added test_segmentation_transforms.py for tests on segmentation transforms. I could not quickly come up with good ways to test these especially since some of them are random, but I'm not sure how rigorous these tests should be. Be gentle!

It would be amazing if I could get this in tomorrow, thank you!!

ravi-mosaicml

Overall LGTM. Two main things:

I don't think many of the type ignores are required. I tried to comment how to remove them. If they are required, please add the pyright error message as a comment next to the type ignore. I realize they can get annoying; happy to help debug these!
Can you move the imagenet normalization parameters out of the normalization_fn class and to the imagenet dataset file? The normalization function should be dataset-generic.

composer/models/deeplabv3/deeplabv3.py

ravi-mosaicml · 2022-01-13T19:44:32Z

composer/models/deeplabv3/deeplabv3.py

+        resnet.model_urls[backbone_arch] = "https://download.pytorch.org/models/resnet101-cd907fc2.pth"
+    else:
+        raise ValueError(f"backbone_arch must be one of ['resnet50', 'resnet101'] not {backbone_arch}")
+    backbone = resnet.__dict__[backbone_arch](pretrained=is_backbone_pretrained,


I presume this works but am a bit confused, as I don't see resnet50 or resnet101 in https://github.com/pytorch/vision/blob/main/torchvision/models/regnet.py

Ah, I took this line from pytorch. The link you posted was for regnet.py, resnet.py is here: https://github.com/pytorch/vision/blob/main/torchvision/models/resnet.py

composer/models/deeplabv3/deeplabv3.py

ravi-mosaicml · 2022-01-13T19:46:42Z

composer/models/deeplabv3/deeplabv3.py

+        self.val_ce = CrossEntropyLoss(ignore_index=-1)
+
+    def forward(self, batch: Batch):
+        x = batch[0]  # type: ignore


Suggested change

x = batch[0] # type: ignore

x = composer.utils.types.as_batch_pair(batch)[0]

I just changed the input type to BatchPair, but it feels weird for forward to take a BatchPair since the label shouldn't be necessary to run the forward pass. This is discussion for another time...

ravi-mosaicml · 2022-01-13T20:02:04Z

composer/datasets/ade20k.py

+
+        if np.random.randint(2):
+            hue_factor = np.random.uniform(-self.hue, self.hue)
+            image = TF.adjust_hue(image, hue_factor)  # type: ignore


ravi-mosaicml · 2022-01-13T20:02:08Z

composer/datasets/ade20k.py

+
+        if contrast_mode == 0 and np.random.randint(2):
+            contrast_factor = np.random.uniform(1 - self.contrast, 1 + self.contrast)
+            image = TF.adjust_contrast(image, contrast_factor)  # type: ignore


ravi-mosaicml · 2022-01-13T20:02:52Z

composer/datasets/ade20k.py

+            image = self.image_transforms(image)
+
+        if self.split in ['train', 'val']:
+            return image, target  # type: ignore


ravi-mosaicml · 2022-01-13T20:03:25Z

composer/datasets/ade20k.py

+                )
+                image_transforms = torch.nn.Sequential(
+                    PhotometricDistoration(brightness=32. / 255, contrast=0.5, saturation=0.5, hue=18. / 255),
+                    PadToSize(size=(self.final_size, self.final_size),


can you add a comment to where these magic values came from?

composer/datasets/ade20k.py

Landanjs · 2022-01-14T00:02:44Z

Okay, I think addressed all of @ravi-mosaicml comments, but let me know if more is needed! Otherwise, I will merge!

A-Jacobson

I'm good with merging this as is. Though I would like to better formalize what constitutes a "baseline" as you've reached/surpassed mmseg's reported IoU with about 3 different recipes.

* First commit ade20k and deeplabv3 * Allow ImageNet pretrained backbone * Allow background class to be ignored * Remove cross entropy metric (temp) * Add mmseg photometric augmentations * Add option to sync bn * Remove dropout and extra 3x3 conv * Use new resnet50 weights * Use 3x3 conv before classification * Add dropout2d * Remove dropout (again) * Select pretrained model, ability to randomly initialize, pytorch's syncBN * Fix LR schedule params * Update with recent merge and add resnet101 * Refactor ade20k pt. 1 * Missed hflips * Initial ignore_class refactor * Remove initial resize for base size in random scale * Remove cityscapes * Polynomial LR schedule * Add ade20k docstrings and some refactoring * Another iteration on ade20k and partial deeplabv3 refactor * Change permissions * total -> train batch size * Decoupled SGDW * Cleanup ade20k code and docstrings * Fix dataset test * Move preprocessing and collate; add defaults * Collate docstring, minor name changes, and ade20k synthetic dataset * Add mosaicml copyright * Fix formatting * Format pt. 2 * Remove RANDOM_INT from synthetic datasets * Format pt. 3 * Update yaml * Monkeypatch model tests to skip pretrained weights * mIoU -> MIoU * PolyLR and no DWD * Add initializers * Only initialize head when using pretrained backbone * Add PolynomialLR docstring and fix scheduler tests * Add mIoU and seg transformation tests * Reorder imports * Reorder imports pt. 2... * Address type ignores, move imagenet norm params, other comments * Get tests to pass * Formatting

Landanjs added 13 commits November 20, 2021 17:35

First commit ade20k and deeplabv3

902f20b

Allow ImageNet pretrained backbone

49e919c

Allow background class to be ignored

00df330

Remove cross entropy metric (temp)

521d57a

Add mmseg photometric augmentations

a9dc8b4

Add option to sync bn

3ebc287

Remove dropout and extra 3x3 conv

6e2e131

Use new resnet50 weights

8286bd4

Use 3x3 conv before classification

2fb99d6

Add dropout2d

9fe458d

Remove dropout (again)

f0abd95

Select pretrained model, ability to randomly initialize, pytorch's sy…

cdd115b

…ncBN

Fix LR schedule params

5ee81a1

hanlint added the release label Dec 2, 2021

Landanjs added 12 commits December 3, 2021 19:28

Merge branch 'dev' into landan/ade20k_deeplabv3

691da5f

Update with recent merge and add resnet101

95fbaf6

Refactor ade20k pt. 1

43ff337

Missed hflips

61f22e9

Initial ignore_class refactor

376e41d

Remove initial resize for base size in random scale

51b790c

Remove cityscapes

c8fdc93

Polynomial LR schedule

0b5a7f3

Add ade20k docstrings and some refactoring

3f87f1f

Another iteration on ade20k and partial deeplabv3 refactor

91ff8aa

Final(?) model and metric refactor

c1d6e54

Change permissions

c65f8f9

Landanjs marked this pull request as ready for review December 17, 2021 22:53

Landanjs added 3 commits December 17, 2021 23:01

Merge with dev

72443fa

total -> train batch size

0444a8c

Decoupled SGDW

d0bb710

Landanjs added 2 commits January 3, 2022 23:47

Update yaml

e04c5ff

Monkeypatch model tests to skip pretrained weights

e476cc2

Landanjs linked an issue Jan 5, 2022 that may be closed by this pull request

Cityscapes + Deeplabv3 benchmark #67

Closed

5 tasks

A-Jacobson approved these changes Jan 6, 2022

View reviewed changes

A-Jacobson reviewed Jan 7, 2022

View reviewed changes

composer/yamls/models/deeplabv3_ade20k.yaml Show resolved Hide resolved

Landanjs added 7 commits January 10, 2022 05:38

mIoU -> MIoU

60d19c6

PolyLR and no DWD

64fd845

Add initializers

f1b23d3

Only initialize head when using pretrained backbone

bdcd2aa

Add PolynomialLR docstring and fix scheduler tests

a15e7e7

Add mIoU and seg transformation tests

ddb9ed9

Reorder imports

f1418db

Reorder imports pt. 2...

4ed1110

Landanjs requested review from A-Jacobson and ravi-mosaicml January 13, 2022 16:52

ravi-mosaicml approved these changes Jan 13, 2022

View reviewed changes

Landanjs added 4 commits January 13, 2022 23:03

Address type ignores, move imagenet norm params, other comments

8c6b18e

Merge with dev; fix ADE20k with recent changes

1a34908

Get tests to pass

c2683b3

Formatting

895204e

A-Jacobson approved these changes Jan 14, 2022

View reviewed changes

Landanjs merged commit 1c769ee into mosaicml:dev Jan 14, 2022

Landanjs deleted the landan/ade20k_deeplabv3 branch January 14, 2022 18:38

Landanjs restored the landan/ade20k_deeplabv3 branch January 19, 2022 20:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DeepLabv3 + ADE20k benchmark #107

DeepLabv3 + ADE20k benchmark #107

Landanjs commented Nov 29, 2021 •

edited

Loading

ravi-mosaicml commented Jan 3, 2022

Landanjs commented Jan 4, 2022

hanlint commented Jan 4, 2022

Landanjs commented Jan 12, 2022 •

edited

Loading

ravi-mosaicml left a comment

ravi-mosaicml Jan 13, 2022

Landanjs Jan 13, 2022

ravi-mosaicml Jan 13, 2022

Landanjs Jan 13, 2022

ravi-mosaicml Jan 13, 2022

ravi-mosaicml Jan 13, 2022

ravi-mosaicml Jan 13, 2022

ravi-mosaicml Jan 13, 2022

Landanjs Jan 13, 2022

Landanjs commented Jan 14, 2022 •

edited

Loading

A-Jacobson left a comment

	x = batch[0] # type: ignore
	x = composer.utils.types.as_batch_pair(batch)[0]

DeepLabv3 + ADE20k benchmark #107

DeepLabv3 + ADE20k benchmark #107

Conversation

Landanjs commented Nov 29, 2021 • edited Loading

Before Merging

ravi-mosaicml commented Jan 3, 2022

Landanjs commented Jan 4, 2022

hanlint commented Jan 4, 2022

Landanjs commented Jan 12, 2022 • edited Loading

ravi-mosaicml left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Landanjs commented Jan 14, 2022 • edited Loading

A-Jacobson left a comment

Choose a reason for hiding this comment

Landanjs commented Nov 29, 2021 •

edited

Loading

Landanjs commented Jan 12, 2022 •

edited

Loading

Landanjs commented Jan 14, 2022 •

edited

Loading