Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Checkpoint from PyTorch-trained model? #24

Closed
nirbenz opened this issue Oct 8, 2018 · 3 comments
Closed

Checkpoint from PyTorch-trained model? #24

nirbenz opened this issue Oct 8, 2018 · 3 comments

Comments

@nirbenz
Copy link

nirbenz commented Oct 8, 2018

Thank you for this awesome repository.
Due to some changes in the training scheme v.s. original darknet code, I wonder if the provided PyTorch weights are converted from the original .weights file - or rather, results of a fresh training session in PyTorch.
If it's the former, it would be wonderful of the results of PyTorch training could be provided as well!

Thanks!

@glenn-jocher
Copy link
Member

@nirbenz these are the official yolov3 weights trained with darknet. Pytorch training is still under development with outstanding issues (see Issue #22).

@nirbenz
Copy link
Author

nirbenz commented Oct 10, 2018

Thanks for the quick answer. I might join the training effort real soon :)

@glenn-jocher
Copy link
Member

@nirbenz no problem. I forgot to mention, pytorch training can be done from scratch or starting from the official yolov3 weights by doing -resume = True. In this case you can train all the parameters or transfer learn on a subset of the parameters (like the yolo layers only, or all the layers after darknet-53 for example).

If your application is similar to coco then it should be faster to begin from the official weights, however we see quick drops in mAP when doing this, indicating a potential problem somewhere in the training code. This is where things remain stuck at at the moment unfortunately.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants