Add validator for classification & detection tasks #160

seongjun-park-dl · 2021-03-15T02:45:29Z

Summary

This PR includes:

Validator for classification and detection tasks.
CLI support for validator.

How to test

python -m unittest -v tests/test_validator.py

Checklist

I submit my changes into the develop branch
I have added description of my changes into CHANGELOG
I have updated the documentation accordingly
I have added tests to cover my changes
I have linked related issues)

License

I submit my code changes under the same MIT License that covers the project.
Feel free to contact the maintainers if that's a concern.
I have updated the license header for each file (see an example below)

# Copyright (C) 2020 Intel Corporation
#
# SPDX-License-Identifier: MIT

datumaro/cli/contexts/project/__init__.py

tests/assets/validator_test_data/stats.py

datumaro/components/validator.py

…e datumaro.components.error, simplified unittest

zhiltsov-max · 2021-03-17T11:37:33Z

Generally, looks ok. I've tried to validate VOC and this is what I got:

"summary": {
        "errors": 319372,
        "warnings": 81169
    },

The other thing I found is:

{
            "anomaly_type": "MissingBboxAnnotation",
            "description": "Item needs one or more bounding box annotations, but not found.",
            "item_id": "2007_000032",
            "severity": "warning"
        },

I think, it would be useful to add item subset as well (or group results by subsets)

This check

{
            "anomaly_type": "FarFromLabelMean",
            "description": "Bounding box annotation '11' in the item has a value of bounding box 'y' that is too far from the label average. (mean of 'person' label: 108.82, got '286.0').",
            "item_id": "2008_004948",
            "severity": "warning"
        },

looks quite strange for 'x' and 'y' bbox attributes - why images in the dataset must have anything common for them? 'width' and 'height' are more or less understandable, but it might be good to check against clusters, instead of a single value of average. Another thought is to allow setting acceptable boundaries.
'long' and 'short' are hard to understand in bbox context.

This check

{
            "anomaly_type": "UndefinedAttribute",
            "description": "Item has the attribute 'difficult' for the label 'person' which is not defined in metadata.",
            "item_id": "2011_002650",
            "severity": "error"
        },

should consider attributes in LabelCategories.attributes as general, i.e. applicable to all labels. Label-specific ones are written in each label.

As a further step, I'd recommend think of providing an option to control enabled checks.

jihyeonyi · 2021-03-18T01:06:19Z

For the 2nd comment (grouping missing bbox cases), your idea seems really nice but I don't think we can apply the groupping function at this PR.
For now, the concept of validator is just listing all the warnings verbosely, so the user should group or filter items by parsing the validation results file.
And I can't think of a nice interface? process? to do that.
We can implement this as a filtering function, transformer, or utility function. I don't know which is the best.

jihyeonyi · 2021-03-18T01:20:30Z

For 3rd comment, I agree with you.
And for the short(=min(w,h)) and long(=max(w,h)), some researchers think they are important for the detection task.
So we include those terms.

For 4rd comment, we didn't know that. Seongjun would fix that.

And for the last comment, we'll consider that later, because it is a design issue.

…tes to include shared attributes.

seongjun-park-dl · 2021-03-18T04:18:56Z

I've addressed 3rd and 4th comments in my most recent commit.

zhiltsov-max · 2021-03-18T09:18:28Z

For now, the concept of validator is just listing all the warnings verbosely, so the user should group or filter items by parsing the validation results file.

I mean, it would simplify/clarify things, if error messages contained a "subset" field, because a single id can (theoretically) be found in several subsets. And even if subsets do not have same item ids, it still makes the process of error localization simpler.

{
"anomaly_type": "...",
"description": "...",
"item_id": "2007_000032",
"subset": "train",
"severity": "..."
},

jihyeonyi · 2021-03-19T05:45:25Z

For now, the concept of validator is just listing all the warnings verbosely, so the user should group or filter items by parsing the validation results file.

I mean, it would simplify/clarify things, if error messages contained a "subset" field, because a single id can (theoretically) be found in several subsets. And even if subsets do not have same item ids, it still makes the process of error localization simpler.

{
"anomaly_type": "...",
"description": "...",
"item_id": "2007_000032", "subset": "train",
"severity": "..."
},

Oh, I can understand now. It's not difficult to add "subset" information and it seems useful.

seongjun-park-dl · 2021-03-19T08:28:54Z

For now, the concept of validator is just listing all the warnings verbosely, so the user should group or filter items by parsing the validation results file.

I mean, it would simplify/clarify things, if error messages contained a "subset" field, because a single id can (theoretically) be found in several subsets. And even if subsets do not have same item ids, it still makes the process of error localization simpler.

{
"anomaly_type": "...",
"description": "...",
"item_id": "2007_000032", "subset": "train",
"severity": "..."
},

Added the subset field to the error messages!

Implemented validator (classification & detection), CLI, and unit tests.

267a78e

zhiltsov-max suggested changes Mar 15, 2021

View reviewed changes

datumaro/cli/contexts/project/__init__.py Outdated Show resolved Hide resolved

tests/assets/validator_test_data/stats.py Outdated Show resolved Hide resolved

datumaro/components/validator.py Show resolved Hide resolved

datumaro/components/validator.py Outdated Show resolved Hide resolved

seongjun-park-dl added 3 commits March 17, 2021 17:13

Modified validation error messages, reworked validation reports to us…

749b379

…e datumaro.components.error, simplified unittest

Removed trailing whitespace.

5797d05

Added newline at the end of datumaro.errors.

8587a5b

fix voc parsing

c0dfc18

Removed 'x' and 'y' stats from being computed. Extended valid attribu…

0c6863c

…tes to include shared attributes.

Added item subset to validation reports.

bf6f9ab

zhiltsov-max previously approved these changes Mar 19, 2021

View reviewed changes

Update changelog

3bb4a1c

zhiltsov-max dismissed their stale review via 3bb4a1c March 19, 2021 08:55

zhiltsov-max approved these changes Mar 19, 2021

View reviewed changes

zhiltsov-max merged commit cdd5184 into develop Mar 19, 2021

zhiltsov-max deleted the add-validator-classification-detection branch March 24, 2021 11:04

zhiltsov-max mentioned this pull request May 27, 2021

Add dataset quality checks #145

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add validator for classification & detection tasks #160

Add validator for classification & detection tasks #160

seongjun-park-dl commented Mar 15, 2021

zhiltsov-max commented Mar 17, 2021

jihyeonyi commented Mar 18, 2021

jihyeonyi commented Mar 18, 2021

seongjun-park-dl commented Mar 18, 2021

zhiltsov-max commented Mar 18, 2021 •

edited

Loading

jihyeonyi commented Mar 19, 2021

seongjun-park-dl commented Mar 19, 2021 •

edited

Loading

Add validator for classification & detection tasks #160

Add validator for classification & detection tasks #160

Conversation

seongjun-park-dl commented Mar 15, 2021

Summary

How to test

Checklist

License

zhiltsov-max commented Mar 17, 2021

jihyeonyi commented Mar 18, 2021

jihyeonyi commented Mar 18, 2021

seongjun-park-dl commented Mar 18, 2021

zhiltsov-max commented Mar 18, 2021 • edited Loading

jihyeonyi commented Mar 19, 2021

seongjun-park-dl commented Mar 19, 2021 • edited Loading

zhiltsov-max commented Mar 18, 2021 •

edited

Loading

seongjun-park-dl commented Mar 19, 2021 •

edited

Loading