Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added user option to specify dataset download path #4774

Closed
wants to merge 21 commits into from
Closed

Added user option to specify dataset download path #4774

wants to merge 21 commits into from

Conversation

kalenmike
Copy link
Member

@kalenmike kalenmike commented Sep 13, 2021

Added the option for the user to pass the --sandbox='custom/path/to' argument which ensures YOLOv5 unzips the dataset into this custom folder. The --project path is overwritten to match what is passed into --sandbox this ensures both the dataset and runs are kept in the same folder.

The idea is that this becomes a sandbox option and all training-specific files are kept in the same user-defined folder to ensure that runs do not mix or overwrite each other.

πŸ› οΈ PR Summary

Made with ❀️ by Ultralytics Actions

🌟 Summary

Enhanced dataset handling and configuration flexibility in YOLOv5 training.

πŸ“Š Key Changes

  • Commented out the Pillow requirement in requirements.txt.
  • Added a --sandbox flag to train.py for alternative file storage, affecting dataset checks and path settings.
  • Simplified image and label verification logic in utils/datasets.py, making error processes more transparent.
  • Updated check_dataset function in utils/general.py to handle custom dataset root paths via the --sandbox flag and clarified autodownload prompts.

🎯 Purpose & Impact

  • πŸ›  Removing the strict dependency on the Pillow library could help in environments where this library is problematic or conflicts with other packages.
  • πŸ—ƒ The new --sandbox option allows all training-related files, including datasets, to be saved in a specified directory, streamlining project organization and improving user flexibility.
  • πŸ“ˆ Simplification of the verification process in dataset utilities enhances code maintainability and could lead to fewer silent failures during dataset loading.
  • πŸ”„ The updates to check_dataset support clearer dataset download feedback and better management of dataset paths, potentially making YOLOv5 more robust in various usage scenarios.

@github-actions
Copy link
Contributor

This pull request has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions YOLOv5 πŸš€ and Vision AI ⭐.

@github-actions github-actions bot added the Stale label Nov 18, 2021
@github-actions github-actions bot closed this Nov 25, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant