How to find the best.pt is the result of which epoch? #8701

xiaohangguo · 2022-07-24T11:44:04Z

Search before asking

I have searched the YOLOv5 issues and discussions and found no similar questions.

Question

1.After iterating many times, I trained a model and produced a "best.pt" file. I know its meaning. My question is: how do I know which training result it is? In other words, can I find the data result of which training it is in result.csv?
2.During the experiment, I found that after the training model is completed, it may break inexplicably, but yolov5 will count the experimental results at the end of the training, draw the f1/p/pr/r/result curve, and produce a train_ batch val_ batch val_ PRED... What should I do if this happens? The training has been completed, but the visualization results have not been counted. I only found the code for drawing several images on the Internet, which is the code calling yolov5, but I can't get all these images.what should I do？

Additional

This is the situation I described. Every time I solve it, I practice it again. This is the most direct but stupid way

This is a normal result

github-actions · 2022-07-24T11:44:47Z

👋 Hello @xiaohangguo, thank you for your interest in YOLOv5 🚀! Please visit our ⭐️ Tutorials to get started, where you can find quickstart guides for simple tasks like Custom Data Training all the way to advanced concepts like Hyperparameter Evolution.

If this is a 🐛 Bug Report, please provide screenshots and minimum viable code to reproduce your issue, otherwise we can not help you.

If this is a custom training ❓ Question, please provide as much information as possible, including dataset images, training logs, screenshots, and a public link to online W&B logging if available.

For business inquiries or professional support requests please visit https://ultralytics.com or email support@ultralytics.com.

Requirements

Python>=3.7.0 with all requirements.txt installed including PyTorch>=1.7. To get started:

git clone https://github.com/ultralytics/yolov5  # clone
cd yolov5
pip install -r requirements.txt  # install

Environments

YOLOv5 may be run in any of the following up-to-date verified environments (with all dependencies including CUDA/CUDNN, Python and PyTorch preinstalled):

Google Colab and Kaggle notebooks with free GPU:
Google Cloud Deep Learning VM. See GCP Quickstart Guide
Amazon Deep Learning AMI. See AWS Quickstart Guide
Docker Image. See Docker Quickstart Guide

Status

If this badge is green, all YOLOv5 GitHub Actions Continuous Integration (CI) tests are currently passing. CI tests verify correct operation of YOLOv5 training (train.py), validation (val.py), inference (detect.py) and export (export.py) on macOS, Windows, and Ubuntu every 24 hours and on every commit.

glenn-jocher · 2022-07-25T16:32:12Z

@xiaohangguo best.pt is saved on every maximum fitness epoch. For detection models fitness is essentially mAP@0.5:0.95:

yolov5/utils/metrics.py

Lines 15 to 19 in b367860

    
           def fitness(x): 
        
               # Model fitness as a weighted combination of metrics 
        
               w = [0.0, 0.0, 0.1, 0.9]  # weights for [P, R, mAP@0.5, mAP@0.5:0.95] 
        
               return (x[:, :4] * w).sum(1)

I don't understand your other question, but you can validate trained models easily using val.py which will create all output images like confusion matrices, PR curves, etc.

python val.py --data ... --weights ...

xiaohangguo · 2022-07-25T16:39:44Z

1.thank you for your help. And your mean that the best.ptis the bigest of mAP@[0.5:0.95]?
2.ok,i will try it ,I think it should be that the computer stuck when saving the drawing result image caused the drawing to fail, I mean that the final image was not exported in the default way, but these seem to be unimportant, haha

xiaohangguo · 2022-07-25T16:52:01Z

Can you give me some suggestions on the training of super parameter learning? I understand the meaning of "weight_caution"'warmupepochs "and other parameters. What I want to ask is, if I want to do some experiments and optimize the model by adjusting the size of super parameters, how should I start? Do you have any suggestions?

glenn-jocher · 2022-07-25T17:09:33Z

@xiaohangguo you can try to tune hyperparameters manually, or you can evolve hyperparameters. Evolution takes a lot of time and resources but is a good solution that requires little human oversight. See Hyperparameter Evolution tutorial to get started.

YOLOv5 Tutorials

Train Custom Data 🚀 RECOMMENDED
Tips for Best Training Results ☘️ RECOMMENDED
Weights & Biases Logging 🌟 NEW
Roboflow for Datasets, Labeling, and Active Learning 🌟 NEW
Multi-GPU Training
PyTorch Hub ⭐ NEW
TFLite, ONNX, CoreML, TensorRT Export 🚀
Test-Time Augmentation (TTA)
Model Ensembling
Model Pruning/Sparsity
Hyperparameter Evolution
Transfer Learning with Frozen Layers ⭐ NEW
Architecture Summary ⭐ NEW

Good luck 🍀 and let us know if you have any other questions!

xiaohangguo · 2022-07-25T17:17:05Z

Thanks for the tutorial, I would like to ask why you have not posted a paper, haha

glenn-jocher · 2022-07-25T17:17:59Z

No time

xiaohangguo · 2022-07-25T17:19:03Z

When I was surfing the Internet, I saw that you said you would eat a hat if you didn't send papers

xiaohangguo · 2022-07-25T18:00:11Z

bro,Can I download yolov3.pt file to modify the parameters of "yolov5" --weights "to train the model

glenn-jocher · 2022-07-25T20:04:05Z

This is true. No time to do that either.

You can get YOLOv3 weights at https://github.com/ultralytics/yolov3

AgusRaharja69 · 2022-11-20T15:41:58Z

in the plots.py I see the way you get the best final epoch for best training results with this equation, can you explain the equation?
index = np.argmax(0.9 * data.values[:, 8] + 0.1 * data.values[:, 7] + 0.9 * data.values[:, 12] + 0.1 * data.values[:, 11])

xiaohangguo · 2022-11-20T16:03:03Z

Do you have any additional information for calculating evaluation metrics, such as what data corresponds to the headers of these 7, 8, 11, and 12 columns? I haven't looked at the yolo code for a long time, where do you mean the code from? Any other information? However, for the equation you gave, I estimate that I have weighted the sum according to the maximum values in these columns to obtain a certain evaluation indicator.

in the plots.py I see the way you get the best final epoch for best training results with this equation, can you explain the equation? index = np.argmax(0.9 * data.values[:, 8] + 0.1 * data.values[:, 7] + 0.9 * data.values[:, 12] + 0.1 * data.values[:, 11])

guptasaumya · 2022-12-07T15:43:55Z

@glenn-jocher , Is top1_acc, the fitness measure for best.pt?

glenn-jocher · 2022-12-07T18:04:01Z

@guptasaumya for classification models yes!

SaraDadjouy · 2023-06-26T12:17:15Z

@glenn-jocher Hi.
In the detection task, best.pt is chosen based on what?

glenn-jocher · 2023-06-26T14:47:47Z

@SaraDadjouy hello!

best.pt is the checkpoint file that has the best validation loss during training. It is selected based on the best overall performance of the model on the validation dataset.

I hope this helps! Let me know if you have any further questions.

Deemowe · 2023-09-21T17:32:01Z

After I run my model, how can I see the mAP@0.5 for the best.pt epoch?

glenn-jocher · 2023-09-21T21:13:16Z

Hello! To evaluate the mAP @Deemowe.5 for the best.pt epoch, you can use the test.py script provided in the YOLOv5 repository.

Here is an example command to run the evaluation:

python3 test.py --data your_data.yaml --weights path/to/best.pt --img-size 640 --iou-thres 0.5 --task test

Make sure to replace your_data.yaml with the path to your data configuration file, and path/to/best.pt with the actual path to your best.pt checkpoint file.

This command will evaluate the model on the test dataset using an IoU threshold of 0.5, which is the default for mAP calculation.

Let me know if you have any more questions!

Wang-taoshuo · 2024-04-29T06:43:57Z

@SaraDadjouy hello!

best.pt is the checkpoint file that has the best validation loss during training. It is selected based on the best overall performance of the model on the validation dataset.

I hope this helps! Let me know if you have any further questions.
hi @glenn-jocher
In the segmentation mode of YOLOv8, which metric is used to select the best.pt?
this val/seg_loss?

glenn-jocher · 2024-04-29T09:12:57Z

Hello @Wang-taoshuo!

In segmentation mode for YOLOv8, best.pt is typically selected based on a combination of metrics, with a significant emphasis on the segmentation loss (val/seg_loss) on the validation dataset. This ensures that the chosen model checkpoint has demonstrated the most effective performance in segmenting the validation data.

If you have more questions or need further clarification, feel free to ask! 😊

Wang-taoshuo · 2024-04-29T11:50:08Z

How do I know which epoch is the best for my best.pt

glenn-jocher · 2024-04-29T17:02:12Z

Hi there! 👋

To find out which epoch corresponds to your best.pt file, you can check the results.csv file that's saved during training. This file logs metrics like precision, recall, mAP, and val loss for each epoch. Look for the epoch with the best performance (usually the lowest validation loss or highest mAP, depending on what best.pt was selected on) to identify the epoch your best.pt model corresponds to.

If you're still not sure, you can also re-evaluate each saved epoch using the test.py script with your validation set and compare the results manually.

Hope this helps! Let me know if you have other questions. 😊

xiaohangguo added the question Further information is requested label Jul 24, 2022

xiaohangguo closed this as completed Jul 26, 2022

IgorGoldberg mentioned this issue Jan 16, 2024

How is best.pt selected ultralytics/ultralytics#7603

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to find the best.pt is the result of which epoch? #8701

How to find the best.pt is the result of which epoch? #8701

xiaohangguo commented Jul 24, 2022

github-actions bot commented Jul 24, 2022 •

edited by glenn-jocher

Loading

glenn-jocher commented Jul 25, 2022

xiaohangguo commented Jul 25, 2022

xiaohangguo commented Jul 25, 2022

glenn-jocher commented Jul 25, 2022 •

edited

Loading

xiaohangguo commented Jul 25, 2022

glenn-jocher commented Jul 25, 2022

xiaohangguo commented Jul 25, 2022

xiaohangguo commented Jul 25, 2022

glenn-jocher commented Jul 25, 2022

AgusRaharja69 commented Nov 20, 2022

xiaohangguo commented Nov 20, 2022

guptasaumya commented Dec 7, 2022

glenn-jocher commented Dec 7, 2022

SaraDadjouy commented Jun 26, 2023

glenn-jocher commented Jun 26, 2023

Deemowe commented Sep 21, 2023

glenn-jocher commented Sep 21, 2023

Wang-taoshuo commented Apr 29, 2024 •

edited

Loading

glenn-jocher commented Apr 29, 2024

Wang-taoshuo commented Apr 29, 2024 •

edited

Loading

glenn-jocher commented Apr 29, 2024

How to find the best.pt is the result of which epoch? #8701

How to find the best.pt is the result of which epoch? #8701

Comments

xiaohangguo commented Jul 24, 2022

Search before asking

Question

Additional

github-actions bot commented Jul 24, 2022 • edited by glenn-jocher Loading

Requirements

Environments

Status

glenn-jocher commented Jul 25, 2022

xiaohangguo commented Jul 25, 2022

xiaohangguo commented Jul 25, 2022

glenn-jocher commented Jul 25, 2022 • edited Loading

YOLOv5 Tutorials

xiaohangguo commented Jul 25, 2022

glenn-jocher commented Jul 25, 2022

xiaohangguo commented Jul 25, 2022

xiaohangguo commented Jul 25, 2022

glenn-jocher commented Jul 25, 2022

AgusRaharja69 commented Nov 20, 2022

xiaohangguo commented Nov 20, 2022

guptasaumya commented Dec 7, 2022

glenn-jocher commented Dec 7, 2022

SaraDadjouy commented Jun 26, 2023

glenn-jocher commented Jun 26, 2023

Deemowe commented Sep 21, 2023

glenn-jocher commented Sep 21, 2023

Wang-taoshuo commented Apr 29, 2024 • edited Loading

glenn-jocher commented Apr 29, 2024

Wang-taoshuo commented Apr 29, 2024 • edited Loading

glenn-jocher commented Apr 29, 2024

github-actions bot commented Jul 24, 2022 •

edited by glenn-jocher

Loading

glenn-jocher commented Jul 25, 2022 •

edited

Loading

Wang-taoshuo commented Apr 29, 2024 •

edited

Loading

Wang-taoshuo commented Apr 29, 2024 •

edited

Loading