Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why is background FP so high? #12859

Closed
1 task done
a41497254 opened this issue Mar 28, 2024 · 5 comments
Closed
1 task done

Why is background FP so high? #12859

a41497254 opened this issue Mar 28, 2024 · 5 comments
Labels
question Further information is requested Stale

Comments

@a41497254
Copy link

Search before asking

Question

I am using my customized data set, and the training result background FP is very high. The following are the relevant training results. What problem did I encounter?
confusion_matrix (5)
results (5)

Additional

No response

@a41497254 a41497254 added the question Further information is requested label Mar 28, 2024
Copy link
Contributor

👋 Hello @a41497254, thank you for your interest in YOLOv5 🚀! Please visit our ⭐️ Tutorials to get started, where you can find quickstart guides for simple tasks like Custom Data Training all the way to advanced concepts like Hyperparameter Evolution.

If this is a 🐛 Bug Report, please provide a minimum reproducible example to help us debug it.

If this is a custom training ❓ Question, please provide as much information as possible, including dataset image examples and training logs, and verify you are following our Tips for Best Training Results.

Requirements

Python>=3.8.0 with all requirements.txt installed including PyTorch>=1.8. To get started:

git clone https://github.com/ultralytics/yolov5  # clone
cd yolov5
pip install -r requirements.txt  # install

Environments

YOLOv5 may be run in any of the following up-to-date verified environments (with all dependencies including CUDA/CUDNN, Python and PyTorch preinstalled):

Status

YOLOv5 CI

If this badge is green, all YOLOv5 GitHub Actions Continuous Integration (CI) tests are currently passing. CI tests verify correct operation of YOLOv5 training, validation, inference, export and benchmarks on macOS, Windows, and Ubuntu every 24 hours and on every commit.

Introducing YOLOv8 🚀

We're excited to announce the launch of our latest state-of-the-art (SOTA) object detection model for 2023 - YOLOv8 🚀!

Designed to be fast, accurate, and easy to use, YOLOv8 is an ideal choice for a wide range of object detection, image segmentation and image classification tasks. With YOLOv8, you'll be able to quickly and accurately detect objects in real-time, streamline your workflows, and achieve new levels of accuracy in your projects.

Check out our YOLOv8 Docs for details and get started with:

pip install ultralytics

@glenn-jocher
Copy link
Member

@a41497254 hello there! 😊 It looks like you're facing an issue with high false positives (FP) for background in your custom dataset. This can be a bit tricky, but let's tackle it together.

A high background FP usually indicates that the model is having difficulty distinguishing between background and foreground classes. This might be due to a few reasons:

  1. Imbalanced Data: Ensure your dataset has a balanced number of instances for each class, including diverse backgrounds.
  2. Data Quality: Verify that your images and annotations are of high quality. Low resolution or poorly annotated images can confuse the model.
  3. Model Complexity: For complex datasets, you might need to adjust the model architecture or increase the depth / width of your model to better capture features.
  4. Training Parameters: Experiment with different learning rates, batch sizes, or augmentation techniques. Sometimes tweaking these can lead to significant improvements.

If you've not done it yet, I recommend checking our documentation on best practices for training on custom datasets here: https://docs.ultralytics.com/yolov5/. It's filled with insights that can help improve your model's performance.

Let's make your YOLOv5 journey successful together! And remember, the road to a perfectly trained model might take some experimentation. Stay curious! 🌟

@a41497254
Copy link
Author

a41497254 commented Mar 28, 2024

Hello @glenn-jocher , I want to ask about data imbalance. This is a picture of my data distribution. I have the most "W" types. Why is high background FP the most serious after the model is trained?
labels

@glenn-jocher
Copy link
Member

Hello @a41497254! 👋 Data imbalance can indeed cause models, like YOLOv5, to have higher false positives, especially in categories with fewer examples. This happens because the model becomes biased towards the classes with more data—like "W" in your case, and might over-predict them, leading to more background being incorrectly classified.

A few strategies to mitigate this include:

  • Data Augmentation: Increase the diversity of your smaller classes through methods such as flipping, scaling, or color variation.
  • Weighted Loss: Adjust the loss function to weigh errors on minority classes more than errors on majority classes.
  • Oversampling: Manually or automatically oversample the minority classes to balance the dataset.

Remember, every dataset and problem is unique, so it might take some experimentation to find the right approach for your specific case. Happy training! 💪

Copy link
Contributor

github-actions bot commented May 9, 2024

👋 Hello there! We wanted to give you a friendly reminder that this issue has not had any recent activity and may be closed soon, but don't worry - you can always reopen it if needed. If you still have any questions or concerns, please feel free to let us know how we can help.

For additional resources and information, please see the links below:

Feel free to inform us of any other issues you discover or feature requests that come to mind in the future. Pull Requests (PRs) are also always welcomed!

Thank you for your contributions to YOLO 🚀 and Vision AI ⭐

@github-actions github-actions bot added the Stale label May 9, 2024
@github-actions github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale May 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested Stale
Projects
None yet
Development

No branches or pull requests

2 participants