Transfer Learning - Freezing Parameters #679

ska6845 · 2020-08-09T03:34:18Z

Does yolov5 support transfer learning?

While training models, is there a possibility to use pretrained weights and modify last few layers?

github-actions · 2020-08-09T03:34:56Z

Hello @shubhamag01, thank you for your interest in our work! Please visit our Custom Training Tutorial to get started, and see our Jupyter Notebook , Docker Image, and Google Cloud Quickstart Guide for example environments.

If this is a bug report, please provide screenshots and minimum viable code to reproduce your issue, otherwise we can not help you.

If this is a custom model or data training question, please note Ultralytics does not provide free personal support. As a leader in vision ML and AI, we do offer professional consulting, from simple expert advice up to delivery of fully customized, end-to-end production solutions for our clients, such as:

Cloud-based AI systems operating on hundreds of HD video streams in realtime.
Edge AI integrated into custom iOS and Android apps for realtime 30 FPS video inference.
Custom data training, hyperparameter evolution, and model exportation to any destination.

For more information please visit https://www.ultralytics.com.

glenn-jocher · 2020-08-09T03:35:44Z

@shubhamag01 this is the default behavior when a pretrained model is specified:
python train.py --weights yolov5s.pt

ska6845 · 2020-08-09T03:41:15Z

@glenn-jocher is it possible to train, removing last few layers and using pretrained model and add new layers and train freezing rest of untouched layers

glenn-jocher · 2020-08-09T03:49:16Z

@shubhamag01 you can do whatever you want

karen-gishyan · 2020-08-09T12:57:59Z

@glenn-jocher is there a difference between

python3 train.py --data coco1cls.data --cfg yolov3-spp.cfg --weights weights/yolov3-spp.pt

and

python3 train.py --data coco1cls.data --cfg yolov3-spp.cfg --weights weights/yolov3-spp.pt --transfer

I assumed that pre-trained weights was the idea behind transfer learning, then I found the tutorial on Transfer Learning it with --transfer command specified.
Thanks in advance.

glenn-jocher · 2020-08-09T17:59:38Z

@karen-gishyan your argument does not exist in train.py. See the argparser arguments at the end of train.py for a list of available arguments.

karen-gishyan · 2020-08-09T18:07:27Z

thanks @glenn-jocher.

ska6845 · 2020-08-11T05:21:22Z

@glenn-jocher I had made a custom yolov5 model and i ran python train.py --img 640 --batch 16 --epochs 100 --data '../data.yaml' --cfg ./models/custom_yolov5s.yaml --weights yolov5s.pt --nosave --cache, I have modified last few layers of yolov5s.yaml and made custom_yolov5s.yaml . Now I want the freeze the layers which are not being modified for yolov5s and want to train my model on remaining. How do I do that?

glenn-jocher · 2020-08-11T05:54:41Z

@ska6845 I've been asked this multiple times, so I've added a section to train.py that handles freezing parameters:

yolov5/train.py

Lines 76 to 83 in e71fd0e

    
           # Freeze 
        
           freeze = ['', ]  # parameter names to freeze (full or partial) 
        
           if any(freeze): 
        
               for k, v in model.named_parameters(): 
        
                   if any(x in k for x in freeze): 
        
                       print('freezing %s' % k) 
        
                       v.requires_grad = False

You can add any parameters you want to this list, with full or partial names, to freeze them before training starts. This code freezes all weights, leaving only biases with active gradients:

    # Freeze
    model.info()
    freeze = ['.weight', ]  # parameter names to freeze (full or partial)
    if any(freeze):
        for k, v in model.named_parameters():
            if any(x in k for x in freeze):
                print('freezing %s' % k)
                v.requires_grad = False
    model.info()

Output:

Model Summary: 191 layers, 7.46816e+06 parameters, 7.46816e+06 gradients
freezing model.0.conv.conv.weight
freezing model.0.conv.bn.weight
freezing model.1.conv.weight
freezing model.1.bn.weight
...
Model Summary: 191 layers, 7.46816e+06 parameters, 11453 gradients

glenn-jocher · 2020-08-11T06:07:45Z

@glenn-jocher TODO: update this to act correctly with optimizer parameter grouping (pg0-gp2):

yolov5/train.py

Lines 89 to 91 in e71fd0e

    
           pg0, pg1, pg2 = [], [], []  # optimizer parameter groups 
        
           for k, v in model.named_parameters(): 
        
               v.requires_grad = True

github-actions · 2020-09-11T00:37:17Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

glenn-jocher · 2020-11-02T10:54:11Z

Removing TODO as this fix is incorporated in PR #1239.

Layer freezing functionality now operates correctly in all cases. To freeze layers, simply add their names to the freeze list in train.py:

yolov5/train.py

Lines 83 to 90 in 187f7c2

    
           # Freeze 
        
           freeze = []  # parameter names to freeze (full or partial) 
        
           for k, v in model.named_parameters(): 
        
               v.requires_grad = True  # train all layers 
        
               if any(x in k for x in freeze): 
        
                   print('freezing %s' % k) 
        
                   v.requires_grad = False

pngmafia · 2020-11-03T12:39:49Z

@glenn-jocher I wanted to freeze the backbone part of yolov5l configuration. Could you please tell me how to do it? And also will freezing the layers help in any way except decreased time in training. I am using coco pretrain for the logo-detection problem.
Thanks

glenn-jocher · 2020-11-03T12:47:49Z

@pngmafia freezing layers will reduce your mAP. You can add the names of the parameters you'd like to freeze to the freeze list. You can verify which layers are frozen by printing the model info:

print(model.info(verbose=True))

pngmafia · 2020-11-03T12:53:55Z

@glenn-jocher I don't mind a decrease in mAP. But I need recall to be high. And also is there a way to give recall more weight than mAP?
Thanks

glenn-jocher · 2020-11-03T12:57:05Z

@pngmafia recall is not a universal metric, it depends on your conf. If you want maximum recall, all you need to do is set conf_thres to zero. Then you will have 100% recall.

pngmafia · 2020-11-03T13:02:57Z

@glenn-jocher can't I use the fitness function in utils. Use different weights for those four metrics? P R mAP:.5 and mAP:.5:.95. I see its 0 0 0.1 and 0.9 now.

glenn-jocher · 2020-11-03T13:06:48Z

@pngmafia sure, you can customize the hyperparameter evolution fitness function as you see fit. See hyperparameter evolution tutorial in https://docs.ultralytics.com/yolov5

pngmafia · 2020-11-03T13:09:34Z

@glenn-jocher Does changing those weights to 0 0.8 0.1 0.1 give me better recall? As compared to 0 0 0.1 and 0.9. Considering I use the same confidence threshold for both the experiments.

glenn-jocher · 2020-11-03T14:09:17Z

@pngmafia hyperparameter evolution maximizes the fitness function here:

yolov5/utils/general.py

Lines 926 to 930 in 187f7c2

    
           def fitness(x): 
        
               # Returns fitness (for use with results.txt or evolve.txt) 
        
               w = [0.0, 0.0, 0.1, 0.9]  # weights for [P, R, mAP@0.5, mAP@0.5:0.95] 
        
               return (x[:, :4] * w).sum(1)

Normal training minimizes loss on your training dataset, and is unrelated to hyperparameter evolution.

pngmafia · 2020-11-04T12:50:05Z

@glenn-jocher so this fitness function is just used only when we train with --evolve option. If I'm training normally on my dataset then, It really doesn't matter what weights I give in that function right?

glenn-jocher · 2020-11-04T13:01:26Z

@pngmafia that's correct.

vedal · 2021-06-01T07:47:24Z

@glenn-jocher thanks for including these changes to freeze params.
How would you suggest going about freezing all but the last layer using your code, as would be done in a classical transfer learning setting (as suggested in https://docs.ultralytics.com/yolov5/tutorials/tips_for_best_training_results)? Which layers would you avoid freezing in this case?

glenn-jocher · 2021-06-01T08:24:30Z

@vedal see Transfer Learning with Frozen Layers tutorial below:

YOLOv5 Tutorials

Train Custom Data 🚀 RECOMMENDED
Tips for Best Training Results ☘️ RECOMMENDED
Weights & Biases Logging 🌟 NEW
Supervisely Ecosystem 🌟 NEW
Multi-GPU Training
PyTorch Hub ⭐ NEW
TorchScript, ONNX, CoreML Export 🚀
Test-Time Augmentation (TTA)
Model Ensembling
Model Pruning/Sparsity
Hyperparameter Evolution
Transfer Learning with Frozen Layers ⭐ NEW
TensorRT Deployment

vedal · 2021-06-01T08:48:28Z

@glenn-jocher cant believe I missed this! Thanks alot! :)

ska6845 added the question Further information is requested label Aug 9, 2020

glenn-jocher added a commit that referenced this issue Aug 11, 2020

Model freeze capability (#679)

e71fd0e

glenn-jocher changed the title ~~Transfer learning~~ Transfer Learning - Freezing Parameters Aug 11, 2020

glenn-jocher added the documentation Improvements or additions to documentation label Aug 11, 2020

glenn-jocher self-assigned this Aug 11, 2020

glenn-jocher added the TODO label Aug 11, 2020

agrawalshubham01 mentioned this issue Aug 11, 2020

added arg parser for freezing parameters #707

Closed

github-actions bot added the Stale label Sep 11, 2020

github-actions bot closed this as completed Sep 16, 2020

This was referenced Oct 26, 2020

parameters freeze invalid #1213

Closed

Change optimizer parameters group method #1239

Merged

glenn-jocher reopened this Nov 1, 2020

glenn-jocher linked a pull request Nov 1, 2020 that will close this issue

Change optimizer parameters group method #1239

Merged

glenn-jocher closed this as completed in #1239 Nov 1, 2020

glenn-jocher removed the TODO label Nov 2, 2020

burglarhobbit pushed a commit to burglarhobbit/yolov5 that referenced this issue Jan 1, 2021

Model freeze capability (ultralytics#679)

0c1371e

glenn-jocher removed Stale documentation Improvements or additions to documentation labels Jan 23, 2021

KMint1819 pushed a commit to KMint1819/yolov5 that referenced this issue May 12, 2021

Model freeze capability (ultralytics#679)

df68038

BjarneKuehl pushed a commit to fhkiel-mlaip/yolov5 that referenced this issue Aug 26, 2022

Model freeze capability (ultralytics#679)

1a4786e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transfer Learning - Freezing Parameters #679

Transfer Learning - Freezing Parameters #679

ska6845 commented Aug 9, 2020

github-actions bot commented Aug 9, 2020 •

edited by glenn-jocher

Loading

glenn-jocher commented Aug 9, 2020

ska6845 commented Aug 9, 2020 •

edited

Loading

glenn-jocher commented Aug 9, 2020

karen-gishyan commented Aug 9, 2020

glenn-jocher commented Aug 9, 2020

karen-gishyan commented Aug 9, 2020

ska6845 commented Aug 11, 2020

glenn-jocher commented Aug 11, 2020

glenn-jocher commented Aug 11, 2020

github-actions bot commented Sep 11, 2020

glenn-jocher commented Nov 2, 2020

pngmafia commented Nov 3, 2020

glenn-jocher commented Nov 3, 2020 •

edited

Loading

pngmafia commented Nov 3, 2020

glenn-jocher commented Nov 3, 2020 •

edited

Loading

pngmafia commented Nov 3, 2020

glenn-jocher commented Nov 3, 2020 •

edited

Loading

pngmafia commented Nov 3, 2020

glenn-jocher commented Nov 3, 2020 •

edited

Loading

pngmafia commented Nov 4, 2020

glenn-jocher commented Nov 4, 2020

vedal commented Jun 1, 2021 •

edited by glenn-jocher

Loading

glenn-jocher commented Jun 1, 2021 •

edited

Loading

vedal commented Jun 1, 2021

Transfer Learning - Freezing Parameters #679

Transfer Learning - Freezing Parameters #679

Comments

ska6845 commented Aug 9, 2020

Does yolov5 support transfer learning?

github-actions bot commented Aug 9, 2020 • edited by glenn-jocher Loading

glenn-jocher commented Aug 9, 2020

ska6845 commented Aug 9, 2020 • edited Loading

glenn-jocher commented Aug 9, 2020

karen-gishyan commented Aug 9, 2020

glenn-jocher commented Aug 9, 2020

karen-gishyan commented Aug 9, 2020

ska6845 commented Aug 11, 2020

glenn-jocher commented Aug 11, 2020

glenn-jocher commented Aug 11, 2020

github-actions bot commented Sep 11, 2020

glenn-jocher commented Nov 2, 2020

pngmafia commented Nov 3, 2020

glenn-jocher commented Nov 3, 2020 • edited Loading

pngmafia commented Nov 3, 2020

glenn-jocher commented Nov 3, 2020 • edited Loading

pngmafia commented Nov 3, 2020

glenn-jocher commented Nov 3, 2020 • edited Loading

pngmafia commented Nov 3, 2020

glenn-jocher commented Nov 3, 2020 • edited Loading

pngmafia commented Nov 4, 2020

glenn-jocher commented Nov 4, 2020

vedal commented Jun 1, 2021 • edited by glenn-jocher Loading

glenn-jocher commented Jun 1, 2021 • edited Loading

YOLOv5 Tutorials

vedal commented Jun 1, 2021

github-actions bot commented Aug 9, 2020 •

edited by glenn-jocher

Loading

ska6845 commented Aug 9, 2020 •

edited

Loading

glenn-jocher commented Nov 3, 2020 •

edited

Loading

glenn-jocher commented Nov 3, 2020 •

edited

Loading

glenn-jocher commented Nov 3, 2020 •

edited

Loading

glenn-jocher commented Nov 3, 2020 •

edited

Loading

vedal commented Jun 1, 2021 •

edited by glenn-jocher

Loading

glenn-jocher commented Jun 1, 2021 •

edited

Loading