Cityscapes AutoLabelling dataset #1000

lkdci · 2023-05-14T14:37:58Z

Cityscapes AutoLabelled dataset were introduced by NVIDIA research group.
paper: Hierarchical Multi-Scale Attention for Semantic Segmentation", https://arxiv.org/abs/2005.10821
Official repo: https://github.com/NVIDIA/semantic-segmentation

This PR includes:

CityscapesConcatDataset to support combination of cityscapes subsets.
example ddrnet recipe with AL dataset.

dagshub · 2023-05-14T14:38:00Z

Join the discussion on DagsHub!

Louis-Dupont

Looks good, if you can just add a test for this dataloader, just to make sure that it runs
https://github.com/Deci-AI/super-gradients/blob/master/tests/unit_tests/dataloader_factory_test.py

Thanks 🙏

src/super_gradients/training/datasets/segmentation_datasets/cityscape_segmentation.py

lkdci · 2023-05-15T10:34:24Z

Hi @Louis-Dupont there is a design conflict about the Dataloader creation.

A dataloader - dataset creation strategy can be done in two different way,

First Approach - dataloader factory:

train_dataloader: cityscapes_train

This approach is problematic since it hinder loading default parameters from a default yaml file defined in code. Then when passing the dataset_params through the config recipe, we force default value that we might not want to include, but they are injected within the code, which contradict the yaml approach for building configs.

see this example:

dataloader factory method:

def my_dataset_train(...):
   return get_data_loader(config_name="my_dataset_default", dataset_cls=MyDataset)

my_dataset_default.yaml:

...
train_dataloader_params:
  sampler: "my_data_sampler"

then in main_recipe_config.yaml:

train_dataloader: my_dataset_train

Following this examples we are not able to initiate my dataset without the sampler field, and we might easily miss is injected in the first place into the dataloader params. (This issue was reported before for the coco dataset with infinite sampler in previous versions.)

Seccond Approach - dataset factory:

Explicitly define the dataset type to use without using the wrapper dataloader factory:

Following the previous example, my_dataset_default.yaml, we add the dataset key:

my_dataset_default.yaml:

...
train_dataloader_params:
  dataset: MyDataset
  sampler: "my_data_sampler"

In contrary to the previous approach we are not bounded by the above default params, and we can set a different dataset params file:

my_dataset_custom_params.yaml:

...
train_dataloader_params:
  dataset: MyDataset

Then in then in main_recipe_config.yaml explicitly choose the required dataset_params to use:

defaults:
  - dataset_params: my_dataset_custom_params

IMO this approach is preferable with better visibility, and doesn't involves hidden behavior within the dataloader factory code.

Why not supporting both approaches?

Both approaches are supported within SG, but there is bug to use both for a given dataset, and the following error is raised:

Error
Traceback (most recent call last):
  File "/home/lior.kadoch/PycharmProjects/super-gradients/tests/unit_tests/dataloader_factory_test.py", line 286, in test_cityscapes_al_train_creation
    dl_train = cityscapes_auto_labelling_train()
  File "/home/lior.kadoch/PycharmProjects/super-gradients/src/super_gradients/training/dataloaders/dataloaders.py", line 548, in cityscapes_auto_labelling_train
    return get_data_loader(
  File "/home/lior.kadoch/PycharmProjects/super-gradients/src/super_gradients/training/dataloaders/dataloaders.py", line 80, in get_data_loader
    dataloader = DataLoader(dataset=dataset, **dataloader_params)
TypeError: type object got multiple values for keyword argument 'dataset'

…nto feature/ALG-1373_cityscapes_auto_label

Louis-Dupont

LGTM

* CityscapesConcatDataset * documentation * ddrnet recipe * unit test * docs * add to init

lkdci added 3 commits May 14, 2023 14:08

CityscapesConcatDataset

646ed1e

documentation

deb9811

ddrnet recipe

add4b24

lkdci requested review from shaydeci, ofrimasad, BloodAxe and Louis-Dupont as code owners May 14, 2023 14:37

Louis-Dupont reviewed May 15, 2023

View reviewed changes

src/super_gradients/training/datasets/segmentation_datasets/cityscape_segmentation.py Outdated Show resolved Hide resolved

lkdci added 3 commits May 15, 2023 16:24

unit test

7f330d3

docs

9539dd7

Merge branch 'master' of https://github.com/Deci-AI/super-gradients i…

a807292

…nto feature/ALG-1373_cityscapes_auto_label

lkdci requested a review from Louis-Dupont May 15, 2023 13:31

add to init

4c5215b

Louis-Dupont approved these changes May 15, 2023

View reviewed changes

Louis-Dupont merged commit b4608f6 into master May 15, 2023

Louis-Dupont deleted the feature/ALG-1373_cityscapes_auto_label branch May 15, 2023 14:06

avideci pushed a commit that referenced this pull request May 23, 2023

Cityscapes AutoLabelling dataset (#1000)

4276428

* CityscapesConcatDataset * documentation * ddrnet recipe * unit test * docs * add to init

avideci pushed a commit that referenced this pull request May 23, 2023

Cityscapes AutoLabelling dataset (#1000)

4892bfd

* CityscapesConcatDataset * documentation * ddrnet recipe * unit test * docs * add to init

geoffrey-g-delhomme pushed a commit to geoffrey-g-delhomme/super-gradients that referenced this pull request May 26, 2023

Cityscapes AutoLabelling dataset (Deci-AI#1000)

2f6db75

* CityscapesConcatDataset * documentation * ddrnet recipe * unit test * docs * add to init

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cityscapes AutoLabelling dataset #1000

Cityscapes AutoLabelling dataset #1000

lkdci commented May 14, 2023

dagshub bot commented May 14, 2023

Louis-Dupont left a comment •

edited

Loading

lkdci commented May 15, 2023

Louis-Dupont left a comment

Cityscapes AutoLabelling dataset #1000

Cityscapes AutoLabelling dataset #1000

Conversation

lkdci commented May 14, 2023

dagshub bot commented May 14, 2023

Louis-Dupont left a comment • edited Loading

Choose a reason for hiding this comment

lkdci commented May 15, 2023

First Approach - dataloader factory:

Seccond Approach - dataset factory:

Why not supporting both approaches?

Louis-Dupont left a comment

Choose a reason for hiding this comment

Louis-Dupont left a comment •

edited

Loading