Add QFocalLoss() #1482

yxNONG · 2020-11-23T11:43:20Z

implement the quality focal loss which is a more general case of focal loss
more detail in https://arxiv.org/abs/2006.04388

In the obj loss (or the case cls loss with label smooth), the targets is no long barely be 0 or 1 (can be 0.7), in this case, the normal focal loss is not work accurately
quality focal loss in behave the same as focal loss when the target is equal to 0 or 1, and work accurately when targets in (0, 1)

example:

targets:
tensor([[0.6225, 0.0000, 0.0000],
[0.9000, 0.0000, 0.0000],
[1.0000, 0.0000, 0.0000]])

pred_prob:
tensor([[0.6225, 0.2689, 0.1192],
[0.7773, 0.5000, 0.2227],
[0.8176, 0.8808, 0.1978]])

focal_loss
tensor([[0.0937, 0.0328, 0.0039],
[0.0166, 0.1838, 0.0199],
[0.0039, 1.3186, 0.0145]])

qfocal_loss
tensor([[7.5373e-08, 3.2768e-02, 3.9179e-03],
[4.8601e-03, 1.8380e-01, 1.9857e-02],
[3.9233e-03, 1.3186e+00, 1.4545e-02]])

we can see that targets[0][0] = 0.6255 is almost the same as pred_prob[0][0] = 0.6225,
the targets[1][0] = 0.9 is greater then pred_prob[1][0] = 0.7773 by 0.1227
however, the focal loss[0][0] = 0.0937 larger then focal loss[1][0] = 0.0166 (which against the purpose of focal loss)

for the quality focal loss , it implement the case of targets not equal to 0 or 1

🛠️ PR Summary

_{Made with ❤️ by Ultralytics Actions}

🌟 Summary

Implementation of Quality Focal Loss (QFL) in the YOLOv5 loss computation.

📊 Key Changes

🆕 Introduced a new class QFocalLoss as a wrapper around the existing loss function.
⚙️ QFocal loss is an adaptation of traditional focal loss that incorporates a quality factor.
🧮 Customizes the original loss by scaling it with a modulating factor based on prediction accuracy and an alpha factor for class imbalance.

🎯 Purpose & Impact

💡 The purpose is to enhance the model's accuracy by focusing training on hard examples and reducing the impact of easy negatives.
🎚 The impact could include improved performance in scenarios with class imbalance or when detecting challenging objects.
🔄 Users may observe changes in loss magnitudes and training dynamics due to the new loss component.

glenn-jocher · 2020-11-23T12:30:55Z

@yxNONG very interesting, thanks for the PR. It is very much the case in YOLOv5 that objectness target values are between 0-1 since they are set to the achieved the IoU (YOLOv3 paper set them to 1.0). This means that objectness targets start near 0 at the beginning of training and trend towards 1 as the regression output becomes more accurate in the later stages of training. Fractional objectness target values like 0.7 are very common.

yxNONG · 2020-11-23T12:53:56Z

@glenn-jocher i notice that it can be not equal 0 and 1 so submit the PR, i will double check if the implement is correct late

glenn-jocher · 2020-11-23T14:53:14Z

/rebase

glenn-jocher · 2020-11-25T11:25:45Z

/rebase

glenn-jocher · 2020-11-25T11:25:55Z

@yxNONG is this PR ready to merge?

implement the quality focal loss which is a more general case of focal loss more detail in https://arxiv.org/abs/2006.04388 In the obj loss (or the case cls loss with label smooth), the targets is no long barely be 0 or 1 (can be 0.7), in this case, the normal focal loss is not work accurately quality focal loss in behave the same as focal loss when the target is equal to 0 or 1, and work accurately when targets in (0, 1) example: targets: tensor([[0.6225, 0.0000, 0.0000], [0.9000, 0.0000, 0.0000], [1.0000, 0.0000, 0.0000]]) ___________________________ pred_prob: tensor([[0.6225, 0.2689, 0.1192], [0.7773, 0.5000, 0.2227], [0.8176, 0.8808, 0.1978]]) ____________________________ focal_loss tensor([[0.0937, 0.0328, 0.0039], [0.0166, 0.1838, 0.0199], [0.0039, 1.3186, 0.0145]]) ______________ qfocal_loss tensor([[7.5373e-08, 3.2768e-02, 3.9179e-03], [4.8601e-03, 1.8380e-01, 1.9857e-02], [3.9233e-03, 1.3186e+00, 1.4545e-02]]) we can see that targets[0][0] = 0.6255 is almost the same as pred_prob[0][0] = 0.6225, the targets[1][0] = 0.9 is greater then pred_prob[1][0] = 0.7773 by 0.1227 however, the focal loss[0][0] = 0.0937 larger then focal loss[1][0] = 0.0166 (which against the purpose of focal loss) for the quality focal loss , it implement the case of targets not equal to 0 or 1

yxNONG · 2020-11-25T12:20:29Z

@glenn-jocher , yes i think it work correctly now

targets:
tensor([[0.6225, 0.0000, 0.0000],
[0.9000, 0.0000, 0.0000],
[0.5000, 1.0000, 0.0000]])
pred_prob:
tensor([[0.6225, 0.2689, 0.1192],
[0.7773, 0.5000, 0.2227],
[0.5000, 0.8808, 0.1978]])

normal_loss:
tensor([[0.6628, 0.3133, 0.1269],
[0.3769, 0.6931, 0.2519],
[0.6931, 0.1269, 0.2204]])

focal_loss
tensor([[0.0937, 0.0328, 0.0039],
[0.0166, 0.1838, 0.0199],
[0.1225, 0.0013, 0.0145]])

qfocal_loss
tensor([[7.5373e-08, 3.2768e-02, 3.9179e-03],
[4.8601e-03, 1.8380e-01, 1.9857e-02],
[0.0000e+00, 1.3060e-03, 1.4545e-02]])

glenn-jocher · 2020-11-25T18:31:15Z

@yxNONG great! I will merge this PR then.

Thank you for your contributions.

* Update loss.py implement the quality focal loss which is a more general case of focal loss more detail in https://arxiv.org/abs/2006.04388 In the obj loss (or the case cls loss with label smooth), the targets is no long barely be 0 or 1 (can be 0.7), in this case, the normal focal loss is not work accurately quality focal loss in behave the same as focal loss when the target is equal to 0 or 1, and work accurately when targets in (0, 1) example: targets: tensor([[0.6225, 0.0000, 0.0000], [0.9000, 0.0000, 0.0000], [1.0000, 0.0000, 0.0000]]) ___________________________ pred_prob: tensor([[0.6225, 0.2689, 0.1192], [0.7773, 0.5000, 0.2227], [0.8176, 0.8808, 0.1978]]) ____________________________ focal_loss tensor([[0.0937, 0.0328, 0.0039], [0.0166, 0.1838, 0.0199], [0.0039, 1.3186, 0.0145]]) ______________ qfocal_loss tensor([[7.5373e-08, 3.2768e-02, 3.9179e-03], [4.8601e-03, 1.8380e-01, 1.9857e-02], [3.9233e-03, 1.3186e+00, 1.4545e-02]]) we can see that targets[0][0] = 0.6255 is almost the same as pred_prob[0][0] = 0.6225, the targets[1][0] = 0.9 is greater then pred_prob[1][0] = 0.7773 by 0.1227 however, the focal loss[0][0] = 0.0937 larger then focal loss[1][0] = 0.0166 (which against the purpose of focal loss) for the quality focal loss , it implement the case of targets not equal to 0 or 1 * Update loss.py

zehuichen123 · 2021-06-08T06:32:20Z

hi, it seems that this QFL does not yield an API for us to use it?

* Update loss.py implement the quality focal loss which is a more general case of focal loss more detail in https://arxiv.org/abs/2006.04388 In the obj loss (or the case cls loss with label smooth), the targets is no long barely be 0 or 1 (can be 0.7), in this case, the normal focal loss is not work accurately quality focal loss in behave the same as focal loss when the target is equal to 0 or 1, and work accurately when targets in (0, 1) example: targets: tensor([[0.6225, 0.0000, 0.0000], [0.9000, 0.0000, 0.0000], [1.0000, 0.0000, 0.0000]]) ___________________________ pred_prob: tensor([[0.6225, 0.2689, 0.1192], [0.7773, 0.5000, 0.2227], [0.8176, 0.8808, 0.1978]]) ____________________________ focal_loss tensor([[0.0937, 0.0328, 0.0039], [0.0166, 0.1838, 0.0199], [0.0039, 1.3186, 0.0145]]) ______________ qfocal_loss tensor([[7.5373e-08, 3.2768e-02, 3.9179e-03], [4.8601e-03, 1.8380e-01, 1.9857e-02], [3.9233e-03, 1.3186e+00, 1.4545e-02]]) we can see that targets[0][0] = 0.6255 is almost the same as pred_prob[0][0] = 0.6225, the targets[1][0] = 0.9 is greater then pred_prob[1][0] = 0.7773 by 0.1227 however, the focal loss[0][0] = 0.0937 larger then focal loss[1][0] = 0.0166 (which against the purpose of focal loss) for the quality focal loss , it implement the case of targets not equal to 0 or 1 * Update loss.py

nikbobo · 2023-02-26T13:28:19Z

In the paper, I only see the hyperparameters is only the gamma, but in your implements it has two hyperparameters, alpha and gamma, which different about that, and how to control the this two hyperparameters?

glenn-jocher · 2023-11-15T16:02:05Z

@nikbobo, the Quality Focal Loss indeed consists of the gamma parameter, which controls the balance between easy and hard examples, and the alpha parameter, which controls the class balance. These two hyperparameters can be adjusted to control the behavior of the loss function. To use QFocalLoss, please refer to the documentation at https://docs.ultralytics.com/yolov5/. Thank you for bringing this to our attention.

yxNONG changed the title ~~Update loss.py~~ add Qfocal_loss Nov 23, 2020

github-actions bot force-pushed the patch-3 branch from e2ed25b to 3dfb3ae Compare November 23, 2020 14:53

yxNONG added 2 commits November 25, 2020 11:26

Update loss.py

93120f2

github-actions bot force-pushed the patch-3 branch from 18f700d to 93120f2 Compare November 25, 2020 11:26

glenn-jocher changed the title ~~add Qfocal_loss~~ Add QFocalLoss() Nov 25, 2020

glenn-jocher merged commit b3ceffb into ultralytics:master Nov 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add QFocalLoss() #1482

Add QFocalLoss() #1482

yxNONG commented Nov 23, 2020 •

edited by UltralyticsAssistant

Loading

glenn-jocher commented Nov 23, 2020 •

edited

Loading

yxNONG commented Nov 23, 2020

glenn-jocher commented Nov 23, 2020

glenn-jocher commented Nov 25, 2020

glenn-jocher commented Nov 25, 2020

yxNONG commented Nov 25, 2020

glenn-jocher commented Nov 25, 2020

zehuichen123 commented Jun 8, 2021

nikbobo commented Feb 26, 2023

glenn-jocher commented Nov 15, 2023

Add QFocalLoss() #1482

Add QFocalLoss() #1482

Conversation

yxNONG commented Nov 23, 2020 • edited by UltralyticsAssistant Loading

🛠️ PR Summary

🌟 Summary

📊 Key Changes

🎯 Purpose & Impact

glenn-jocher commented Nov 23, 2020 • edited Loading

yxNONG commented Nov 23, 2020

glenn-jocher commented Nov 23, 2020

glenn-jocher commented Nov 25, 2020

glenn-jocher commented Nov 25, 2020

yxNONG commented Nov 25, 2020

glenn-jocher commented Nov 25, 2020

zehuichen123 commented Jun 8, 2021

nikbobo commented Feb 26, 2023

glenn-jocher commented Nov 15, 2023

yxNONG commented Nov 23, 2020 •

edited by UltralyticsAssistant

Loading

glenn-jocher commented Nov 23, 2020 •

edited

Loading