Hard coded 'val' key in val.py #4635

robin-maillot · 2021-09-01T09:20:54Z

Before submitting a bug report, please be aware that your issue must be reproducible with all of the following,
otherwise it is non-actionable, and we can not help you:

Current repo: run git fetch && git status -uno to check and git pull to update repo
Common dataset: coco.yaml or coco128.yaml
Common environment: Colab, Google Cloud, or Docker image. See https://github.com/ultralytics/yolov5#environments

If this is a custom dataset/training question you must include your train*.jpg, val*.jpg and results.png
figures, or we can not help you. You can generate these with utils.plot_results().

🐛 Bug

A clear and concise description of what the bug is.

When running val.py:run() the is_coco variable is calculated based on the val key and not the task key.

This means when running python val.py --data data/coco128.yaml --task test

To Reproduce (REQUIRED)

Input:

modify coco128.yaml to : (I could not add the yaml file directly for some reason sorry)

# YOLOv5 🚀 by Ultralytics, GPL-3.0 license
# COCO128 dataset https://www.kaggle.com/ultralytics/coco128 (first 128 images from COCO train2017)
# Example usage: python train.py --data coco128.yaml
# parent
# ├── yolov5
# └── datasets
#     └── coco128  ← downloads here


# Train/val/test sets as 1) dir: path/to/imgs, 2) file: path/to/imgs.txt, or 3) list: [path/to/imgs1, path/to/imgs2, ..]
path: ../datasets/coco128  # dataset root dir
train: images/train2017  # train images (relative to 'path') 128 images
#val: images/train2017  # val images (relative to 'path') 128 images
test: images/train2017 # test images (optional)

# Classes
nc: 80  # number of classes
names: ['person', 'bicycle', 'car', 'motorcycle', 'airplane', 'bus', 'train', 'truck', 'boat', 'traffic light',
        'fire hydrant', 'stop sign', 'parking meter', 'bench', 'bird', 'cat', 'dog', 'horse', 'sheep', 'cow',
        'elephant', 'bear', 'zebra', 'giraffe', 'backpack', 'umbrella', 'handbag', 'tie', 'suitcase', 'frisbee',
        'skis', 'snowboard', 'sports ball', 'kite', 'baseball bat', 'baseball glove', 'skateboard', 'surfboard',
        'tennis racket', 'bottle', 'wine glass', 'cup', 'fork', 'knife', 'spoon', 'bowl', 'banana', 'apple',
        'sandwich', 'orange', 'broccoli', 'carrot', 'hot dog', 'pizza', 'donut', 'cake', 'chair', 'couch',
        'potted plant', 'bed', 'dining table', 'toilet', 'tv', 'laptop', 'mouse', 'remote', 'keyboard', 'cell phone',
        'microwave', 'oven', 'toaster', 'sink', 'refrigerator', 'book', 'clock', 'vase', 'scissors', 'teddy bear',
        'hair drier', 'toothbrush']  # class names


# Download script/URL (optional)
download: https://github.com/ultralytics/yolov5/releases/download/v1.0/coco128.zip

Run : python val.py --data data/coco128.yaml --task test

Output:

Traceback (most recent call last):
  File "val.py", line 354, in <module>
    main(opt)
  File "val.py", line 329, in main
    run(**vars(opt))
  File "D:\Nanovare\dev\.yolov5-venv\lib\site-packages\torch\autograd\grad_mode.py", line 28, in decorate_context
    return func(*args, **kwargs)
  File "val.py", line 137, in run
    is_coco = type(data['val']) is str and data['val'].endswith('coco/val2017.txt')  # COCO dataset
KeyError: 'val'

Expected behavior

A clear and concise description of what you expected to happen.

is_coco should be calculated based on task key and not hard coded val key

Environment

If applicable, add screenshots to help explain your problem.

OS: Windows10
GPU: NVIDIA GeForce RTX 2060

Additional context

Add any other context about the problem here.

The text was updated successfully, but these errors were encountered:

glenn-jocher · 2021-09-01T14:30:20Z

@robin-maillot good news 😃! Your original issue may now be fixed ✅ in PR #4642. This PR uses the safer .get() method to retrieve the val key, and will return None by default if val key is missing, setting is_coco=False.

To receive this update:

Git – git pull from within your yolov5/ directory or git clone https://github.com/ultralytics/yolov5 again
PyTorch Hub – Force-reload with model = torch.hub.load('ultralytics/yolov5', 'yolov5s', force_reload=True)
Notebooks – View updated notebooks
Docker – sudo docker pull ultralytics/yolov5:latest to update your image

Thank you for spotting this issue and informing us of the problem. Please let us know if this update resolves the issue for you, and feel free to inform us of any other issues you discover or feature requests that come to mind. Happy trainings with YOLOv5 🚀!

robin-maillot · 2021-09-01T14:41:57Z

@glenn-jocher seems good but in the case that we are not training should it not be the task key instead of val?

Because later on we use the data['task'] to create the dataloader:

    # Dataloader
    if not training:
        if device.type != 'cpu':
            model(torch.zeros(1, 3, imgsz, imgsz).to(device).type_as(next(model.parameters())))  # run once
        task = task if task in ('train', 'val', 'test') else 'val'  # path to train/val/test images
        dataloader = create_dataloader(data[task], imgsz, batch_size, gs, single_cls, pad=0.5, rect=True,
                                       prefix=colorstr(f'{task}: '))[0]

I feel like in this case it makes sense to use something like:

is_coco = isinstance(data.get('task'), str) and data['val'].endswith('coco/val2017.txt') # COCO dataset

Since I do not use coco anyways the fix you provided solves my issues, thanks :)

I can submit a PR directly if you prefer to review it that way?

glenn-jocher · 2021-09-01T14:46:44Z

@robin-maillot well the data dict for coco.yaml will always have a val key regardless of task = 'train|val|test', so in this line we are only trying to establish if the dataset is the official COCO dataset. The user should still be able to run python val.py --task test and everything will work correctly I think.

glenn-jocher · 2021-09-01T14:48:07Z

@robin-maillot also remember sometimes task can also be set to speed or study to reproduce README plots and tables.

yolov5/val.py

Line 303 in fad57c2

    
           parser.add_argument('--task', default='val', help='train, val, test, speed or study')

robin-maillot · 2021-09-01T14:51:33Z

@glenn-jocher makes sense, I was thinking of transfer learning cases where one might want to train/validate on some other dataset but test on coco, this might be an edge case that is better solved using the speed or study options like you mention.

Thanks for taking the time to solve the bug :)

robin-maillot added the bug Something isn't working label Sep 1, 2021

glenn-jocher linked a pull request Sep 1, 2021 that will close this issue

Fix is_coco on missing data['val'] key #4642

Merged

glenn-jocher closed this as completed in #4642 Sep 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hard coded 'val' key in val.py #4635

Hard coded 'val' key in val.py #4635

robin-maillot commented Sep 1, 2021

glenn-jocher commented Sep 1, 2021

robin-maillot commented Sep 1, 2021 •

edited

Loading

glenn-jocher commented Sep 1, 2021

glenn-jocher commented Sep 1, 2021

robin-maillot commented Sep 1, 2021

Hard coded 'val' key in val.py #4635

Hard coded 'val' key in val.py #4635

Comments

robin-maillot commented Sep 1, 2021

🐛 Bug

To Reproduce (REQUIRED)

Expected behavior

Environment

Additional context

glenn-jocher commented Sep 1, 2021

robin-maillot commented Sep 1, 2021 • edited Loading

glenn-jocher commented Sep 1, 2021

glenn-jocher commented Sep 1, 2021

robin-maillot commented Sep 1, 2021

robin-maillot commented Sep 1, 2021 •

edited

Loading