Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

training crash #27

Closed
zidanexu opened this issue Jun 9, 2020 · 2 comments
Closed

training crash #27

zidanexu opened this issue Jun 9, 2020 · 2 comments
Labels
bug Something isn't working Stale

Comments

@zidanexu
Copy link

zidanexu commented Jun 9, 2020

  • Python
  • PyTorch 1.4
  • tesla P40
  • centos
    the command line: python train.py --data data/coco.yaml --cfg models/yolov5s.yaml --weights '' --batch-size 128 --resume --device='0,1,2,3,4,5,6,7'
    image
    btw, this case has also happened at 46 epochs
@zidanexu zidanexu added the bug Something isn't working label Jun 9, 2020
@glenn-jocher
Copy link
Member

@zidanexu I've not seen this error before, so I'm sorry but I don't think I can help you.

I might recommend training in a common environment, such as our docker container to avoid any potential environment issues. For example environments see https://github.com/ultralytics/yolov5/wiki

@github-actions
Copy link
Contributor

github-actions bot commented Aug 1, 2020

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working Stale
Projects
None yet
Development

No branches or pull requests

2 participants