HUB dataset_stats() error reporting #8192

glenn-jocher · 2022-06-13T10:35:05Z

🛠️ PR Summary

_{Made with ❤️ by Ultralytics Actions}

🌟 Summary

Enhancing dataset yaml loading robustness in dataloaders.py.

📊 Key Changes

Wrapped yaml loading inside a try block.
Added an except block to raise a specific exception on yaml loading failure.

🎯 Purpose & Impact

Purpose: To provide a more clear and user-friendly error message if there are issues loading the dataset yaml file.
Impact: Users will encounter a more descriptive error when yaml loading fails, making debugging easier and improving overall user experience. 🛠️

for more information, see https://pre-commit.ci

glenn-jocher · 2022-06-13T13:14:40Z

@kalenmike how's this?

kalenmike · 2022-06-13T15:42:20Z

@glenn-jocher It will work great to check for YAML parse errors, but can we handle for the other errors as well? Like the paths don't exist for example.

glenn-jocher · 2022-06-13T22:27:44Z

@kalenmike hmm, most of the finer checks are run by the default YOLOv5 function here:

yolov5/utils/dataloaders.py

Line 1031 in 6a67594

check_dataset(data, autodownload) # download dataset if missing

It looks like only the val path is specifically checked though. We could expand to all parts of the dataset: train, val, test:
i.e. L485

yolov5/utils/general.py

Lines 472 to 487 in 6a67594

    
           # Resolve paths 
        
           path = Path(extract_dir or data.get('path') or '')  # optional 'path' default to '.' 
        
           if not path.is_absolute(): 
        
               path = (ROOT / path).resolve() 
        
           for k in 'train', 'val', 'test': 
        
               if data.get(k):  # prepend path 
        
                   data[k] = str(path / data[k]) if isinstance(data[k], str) else [str(path / x) for x in data[k]] 
        
           # Parse yaml 
        
           train, val, test, s = (data.get(x) for x in ('train', 'val', 'test', 'download')) 
        
           if val: 
        
               val = [Path(x).resolve() for x in (val if isinstance(val, list) else [val])]  # val path 
        
               if not all(x.exists() for x in val): 
        
                   LOGGER.info(emojis('\nDataset not found ⚠, missing paths %s' % [str(x) for x in val if not x.exists()])) 
        
                   if not s or not autodownload: 
        
                       raise Exception(emojis('Dataset not found ❌'))

glenn-jocher · 2022-06-17T16:37:44Z

@kalenmike PR merged to avoid letting it linger. Will update HUB tag to master and we can work on additional error reporting next week.

* HUB dataset_stats() error reporting * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Update dataloaders.py Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

awetzel-lionpower · 2023-03-14T19:52:57Z

I am currently getting the generic "Failed: Processing Error". Wondering if the more detailed error reporting has been finished. My data works fine with yolov5.

glenn-jocher · 2023-11-15T15:52:15Z

@awetzel-lionpower glad your data works with YOLOv5! We've enhanced error reporting with the latest changes. Please update to the latest version and provide the specific error message if the issue persists. Always happy to help!

glenn-jocher and others added 3 commits June 13, 2022 12:34

HUB dataset_stats() error reporting

886e4ca

[pre-commit.ci] auto fixes from pre-commit.com hooks

3d5bbd7

for more information, see https://pre-commit.ci

Update dataloaders.py

4748a0b

glenn-jocher assigned kalenmike and glenn-jocher Jun 13, 2022

glenn-jocher requested a review from kalenmike June 13, 2022 13:14

glenn-jocher mentioned this pull request Jun 13, 2022

Dataset upload fails with large number of images ultralytics/hub#42

Closed

1 task

Merge branch 'master' into update/hub_errors

263902a

glenn-jocher merged commit d605138 into master Jun 17, 2022

glenn-jocher deleted the update/hub_errors branch June 17, 2022 16:37

Hojland mentioned this pull request Oct 17, 2022

feat/bump Go-Autonomous/yolov5#15

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HUB dataset_stats() error reporting #8192

HUB dataset_stats() error reporting #8192

glenn-jocher commented Jun 13, 2022 •

edited by UltralyticsAssistant

Loading

glenn-jocher commented Jun 13, 2022

kalenmike commented Jun 13, 2022

glenn-jocher commented Jun 13, 2022

glenn-jocher commented Jun 17, 2022

awetzel-lionpower commented Mar 14, 2023

glenn-jocher commented Nov 15, 2023

HUB dataset_stats() error reporting #8192

HUB dataset_stats() error reporting #8192

Conversation

glenn-jocher commented Jun 13, 2022 • edited by UltralyticsAssistant Loading

🛠️ PR Summary

🌟 Summary

📊 Key Changes

🎯 Purpose & Impact

glenn-jocher commented Jun 13, 2022

kalenmike commented Jun 13, 2022

glenn-jocher commented Jun 13, 2022

glenn-jocher commented Jun 17, 2022

awetzel-lionpower commented Mar 14, 2023

glenn-jocher commented Nov 15, 2023

glenn-jocher commented Jun 13, 2022 •

edited by UltralyticsAssistant

Loading