test image label shift #28

lucasjinreal · 2020-06-09T08:52:08Z

I have train image label normal but test image label shift, how could that possible? i have exactly same number of images and labels:

Some normal, some shifted.

While train is normal. What's the problem could be?

glenn-jocher · 2020-06-09T20:08:54Z

@jinfagang unless you can you reproduce this on COCO, your labels are the problem.

lucasjinreal · 2020-06-10T01:51:19Z

128

glenn-jocher · 2020-06-10T02:07:18Z

You should clone the repo again, something is wrong with your local version.

glenn-jocher · 2020-06-10T02:09:13Z

Also the colab notebook shows easy examples of training.

lucasjinreal · 2020-06-10T02:39:32Z

@glenn-jocher Hi, I found it caused by I have not same length images and labels in the directory.

What should I do if I have multiple batchs data, I placed all images in a single folder and very batch label in a different dir, and every time I using a different batch, I will soft link that batch labels to labels folder.

In this case, it can not load properly since their length not same, how to make it workable if I dont' want make many copies of images. (which waste disk space and not orgniseable)

glenn-jocher · 2020-06-10T03:48:11Z

@jinfagang yes I understand. The repo caches labels into *.npy files for faster loading. Perhaps a rework of this is in order for greater flexibility.

A quick fix is to delete the *.npy files and use text labels instead, or to update the dataloader to ignore *.npy files:

yolov5/utils/datasets.py

Lines 332 to 341 in c14368d

    
           np_labels_path = str(Path(self.label_files[0]).parent) + '.npy'  # saved labels in *.npy file 
        
           if os.path.isfile(np_labels_path): 
        
               s = np_labels_path  # print string 
        
               x = np.load(np_labels_path, allow_pickle=True) 
        
               if len(x) == n: 
        
                   self.labels = x 
        
                   labels_loaded = True 
        
           else: 
        
               s = path.replace('images', 'labels')

balag59 · 2020-07-08T07:09:20Z

@jinfagang yes I understand. The repo caches labels into *.npy files for faster loading. Perhaps a rework of this is in order for greater flexibility.

A quick fix is to delete the *.npy files and use text labels instead, or to update the dataloader to ignore *.npy files:

yolov5/utils/datasets.py

Lines 332 to 341 in c14368d

np_labels_path = str(Path(self.label_files[0]).parent) + '.npy' # saved labels in *.npy file

if os.path.isfile(np_labels_path):

s = np_labels_path # print string

x = np.load(np_labels_path, allow_pickle=True)

if len(x) == n:

self.labels = x

labels_loaded = True

else:

s = path.replace('images', 'labels')

@glenn-jocher Thank you for your work! I seem to be facing the same issue currently. Has there been any fix that's been attempted to resolve this? I've commented out the part in dataset.py where l is created from the npy file if labels_loaded=True so that l is created directly from the txt files. However, the visualized training batch 0 and 1 still seem to suffer from this issue.

Will this affect only the visualizations or will the predictions be affected too?
Is there anything else that can I do to ensure that this doesn't happen?

glenn-jocher · 2020-07-08T07:16:29Z

@balag59 ha, the visualizations are not there to look pretty, they are showing you the actual data used for training. If they are not correct, your training won't be correct. If you delete npy files and are still seeing problems, then your actual labels are likely problematic. I'm working on a major upgrade of label caching and loading. This should be done this week or next week.

In the meantime start from the tutorial, which works well, and make sure your custom dataset is aligned exactly the same format and directory structure as coco128. You can inspect coco128 in Kaggle datasets also:
https://www.kaggle.com/ultralytics/coco128

balag59 · 2020-07-08T07:31:47Z

@balag59 ha, the visualizations are not there to look pretty, they are showing you the actual data used for training. If they are not correct, your training won't be correct. If you delete npy files and are still seeing problems, then your actual labels are likely problematic. I'm working on a major upgrade of label caching and loading. This should be done this week or next week.

In the meantime start from the tutorial, which works well, and make sure your custom dataset is aligned exactly the same format and directory structure as coco128. You can inspect coco128 in Kaggle datasets also:
https://www.kaggle.com/ultralytics/coco128

@glenn-jocher Thank you for the prompt response! I have used your yolov3 implementation before on custom datasets extensively and it worked great and this preparation here is very similar to that except the yml part. This dataset is a collection of 4 datasets(one of them being coco) and there seems to be a weird issue where the image shown and the name for the image shown itself don't match. I've added a prefix to each of the other 3 datasets and none to coco so I know where each image is from. Could there be an issue of a mismatch between the images and labels? The number of images and labels in the train folder are not the same.I'm using a single class only( people class).Here is what the visualizations look like:

glenn-jocher · 2020-07-08T07:36:54Z

Yes it may be related to a different number of images and labels. The yolov5 repo has updates that removed the need to create a list of images used for training, you can now simply supply an /images folder in your data.yaml file, and it will look for any labels in a corresponding /labels folder. This is how coco128 works.

balag59 · 2020-07-08T07:39:24Z

Yes it may be related to a different number of images and labels. The yolov5 repo has updates that removed the need to create a list of images used for training, you can now simply supply an /images folder in your data.yaml file, and it will look for any labels in a corresponding /labels folder. This is how coco128 works.

I'm aware and that is that is what I'm doing right now. I've just given the path to the images folder in the yml file. This is still happening.Would creating a list make any difference or is there something else that I need to try?

MarioDuran · 2020-07-09T11:08:00Z

Hi @balag59 if the number of images and labels of your custom dataset is less than 1000, there is no problem, but if you have more than 1000 images, you will need to fill your custom dataset with new or repeated data, so you end up with a custom dataset of size in multiples of 1000.

at least this workaround is working for me. hope it helps.

github-actions · 2020-08-20T00:32:44Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

lucasjinreal added the bug Something isn't working label Jun 9, 2020

glenn-jocher removed the bug Something isn't working label Jun 9, 2020

glenn-jocher closed this as completed Jun 10, 2020

glenn-jocher reopened this Jun 10, 2020

DLLXW mentioned this issue Jul 3, 2020

RuntimeError: CUDA error: no kernel image is available for execution on the device (nms_cuda at /tmp/pip-req-build-9d9zypi6/torchvision/csrc/cuda/nms_cuda.cu:127) #281

Closed

github-actions bot added the Stale label Aug 20, 2020

github-actions bot closed this as completed Aug 25, 2020

zhiqwang mentioned this issue Feb 23, 2021

Dear author, can you provide a visualization scheme for YOLOV5 feature graphs during detect.py? Thank you! #2259

Closed

cool112624 mentioned this issue May 16, 2023

DDP training with multiple gpu using wsl #11519

Closed

1 task

thiendt2k1 mentioned this issue Jul 7, 2023

(Possibly) Cuda version conflict when training classifcation on custom dataset on Colab #11833

Closed

1 task

jcluo1994 mentioned this issue Oct 10, 2023

Using multi-GPU training reports errors #12213

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test image label shift #28

test image label shift #28

lucasjinreal commented Jun 9, 2020

glenn-jocher commented Jun 9, 2020

lucasjinreal commented Jun 10, 2020

glenn-jocher commented Jun 10, 2020

glenn-jocher commented Jun 10, 2020

lucasjinreal commented Jun 10, 2020

glenn-jocher commented Jun 10, 2020

balag59 commented Jul 8, 2020

glenn-jocher commented Jul 8, 2020

balag59 commented Jul 8, 2020 •

edited

Loading

glenn-jocher commented Jul 8, 2020

balag59 commented Jul 8, 2020

MarioDuran commented Jul 9, 2020

github-actions bot commented Aug 20, 2020

test image label shift #28

test image label shift #28

Comments

lucasjinreal commented Jun 9, 2020

glenn-jocher commented Jun 9, 2020

lucasjinreal commented Jun 10, 2020

glenn-jocher commented Jun 10, 2020

glenn-jocher commented Jun 10, 2020

lucasjinreal commented Jun 10, 2020

glenn-jocher commented Jun 10, 2020

balag59 commented Jul 8, 2020

glenn-jocher commented Jul 8, 2020

balag59 commented Jul 8, 2020 • edited Loading

glenn-jocher commented Jul 8, 2020

balag59 commented Jul 8, 2020

MarioDuran commented Jul 9, 2020

github-actions bot commented Aug 20, 2020

balag59 commented Jul 8, 2020 •

edited

Loading