AttributeError: 'NoneType' object has no attribute 'python_exit_status' #5913

awsaf49 · 2021-12-07T20:17:20Z

Search before asking

I have searched the YOLOv5 issues and found no similar bug report.

YOLOv5 Component

Training

Bug

After completion of the training, I'm getting this error,

wandb: 
Exception ignored in: <function _MultiProcessingDataLoaderIter.__del__ at 0x7f8609b77710>
Traceback (most recent call last):
  File "/opt/conda/lib/python3.7/site-packages/torch/utils/data/dataloader.py", line 1328, in __del__
  File "/opt/conda/lib/python3.7/site-packages/torch/utils/data/dataloader.py", line 1262, in _shutdown_workers
AttributeError: 'NoneType' object has no attribute 'python_exit_status'
Exception ignored in: <function _MultiProcessingDataLoaderIter.__del__ at 0x7f8609b77710>
Traceback (most recent call last):
  File "/opt/conda/lib/python3.7/site-packages/torch/utils/data/dataloader.py", line 1328, in __del__
  File "/opt/conda/lib/python3.7/site-packages/torch/utils/data/dataloader.py", line 1262, in _shutdown_workers
AttributeError: 'NoneType' object has no attribute 'python_exit_status'

Environment

Kaggle

Minimal Reproducible Example

Notebook Link here

Additional

I think I saw a similar post but it was in Japanese which I couldn't understand hence I'm posting it here in English.

Are you willing to submit a PR?

Yes I'd like to help by submitting a PR!

The text was updated successfully, but these errors were encountered:

glenn-jocher · 2021-12-08T22:18:22Z

@AyushExel seems like a wandb issue here.

@awsaf49 can you provide example code that reproduces the same error message for us please?

awsaf49 · 2021-12-09T02:16:45Z

@glenn-jocher Here's the notebook1 on Kaggle to reproduce. Notebook is public so you'll be able to simply fork and run to reproduce the issue.

T1M-CHEN · 2021-12-09T03:08:01Z

@awsaf49 I also meet this problem while training in kaggle, you can use '--workers 0' this parameter to avoid this problem while training temporary, maybe it can help you.

awsaf49 · 2021-12-09T04:24:52Z

@T1M-CHEN you are right --workers 0 does work but won't it affect the speed as not full resources are being used?

AyushExel · 2021-12-09T09:16:50Z

@glenn-jocher this doesn't seem like its related to wandb.

T1M-CHEN · 2021-12-09T09:28:46Z

@awsaf49
I think there are some problems among v6.0 codes, kaggle and multiprocess module, --workers 0 may increase your training time, but I'm not sure whether this paremeter will affect the acc.

If you don't need to use the advanced features in v6.0, you can use v5.0 codes on kaggle, using parameter --workers 2 to use full resources temporary.

glenn-jocher · 2021-12-09T12:33:20Z

@awsaf49 @T1M-CHEN I just updated the Kaggle notebook to the latest, so it's now aligned with the Colab notebook. I see 4 CPUs on Kaggle so you should be able to use up to --workers 4, but regardless YOLOv5 will limit itself to 4 workers rather than the default 8 if the environment only supports 4 workers.

The error may simply due to resource saturation, so yes perhaps reducing --workers to 3 or 2 would help.

glenn-jocher · 2021-12-09T12:42:04Z

@awsaf49 @T1M-CHEN strangely the Kaggle notebook is not displaying any LOGGER outputs from YOLOv5, only print() statement outputs. I'm not sure what the problem is, as LOGGER statements appear in all other environments we use (PyCharm, Docker, Colab, GCP, AWS).

bhachauk · 2021-12-09T15:14:02Z

@T1M-CHEN
Tried with version : v5.0

Traceback (most recent call last):
  File "train.py", line 543, in <module>
    train(hyp, opt, device, tb_writer)
  File "train.py", line 87, in train
    ckpt = torch.load(weights, map_location=device)  # load checkpoint
  File "/opt/conda/lib/python3.7/site-packages/torch/serialization.py", line 607, in load
    return _load(opened_zipfile, map_location, pickle_module, **pickle_load_args)
  File "/opt/conda/lib/python3.7/site-packages/torch/serialization.py", line 882, in _load
    result = unpickler.load()
  File "/opt/conda/lib/python3.7/site-packages/torch/serialization.py", line 875, in find_class
    return super().find_class(mod_name, name)
AttributeError: Can't get attribute 'SPPF' on <module 'models.common' from '/kaggle/working/yolov5/models/common.py'>

and also with v6.0 still can't resolve the same issue... even changing the workers argument as mentioned by @glenn-jocher
AttributeError: 'NoneType' object has no attribute 'python_exit_status'

T1M-CHEN · 2021-12-10T01:35:35Z

@glenn-jocher
Yes, I also meet this problem, v6.0 codes clone from git can't display print() statement outputs properly, but using v5.0 codes on kaggle can print normally.

T1M-CHEN · 2021-12-10T01:46:58Z

@Bhanuchander210
the quote seems that you are not using the correct weights, maybe you should check out the weights and codes version.
You can try to search author's name on kaggle to use the latest codes, maybe this can help you.

awsaf49 · 2021-12-10T02:56:56Z

@awsaf49 @T1M-CHEN I just updated the Kaggle notebook to the latest, so it's now aligned with the Colab notebook. I see 4 CPUs on Kaggle so you should be able to use up to --workers 4, but regardless YOLOv5 will limit itself to 4 workers rather than the default 8 if the environment only supports 4 workers.

The error may simply due to resource saturation, so yes perhaps reducing --workers to 3 or 2 would help.

@glenn-jocher in GPU kaggle has 2 CPU so I tried --workers 1 but still got the same error.

Tears1997 · 2021-12-15T05:39:30Z

I also meet this problem while training on my lab's server(Ubuntu18.04) and use initial weight file yolov5x.pt. I found that this error does not occur if the --workers parameter is set to 0. However, this problem occurs when the parameter is set to a value other than 0. The code version is v6.0. The GPUs is 4 * RTX 6000 and CPU environment is as follows.

The error information is as follows：

It seems that this error will not affect normal training, but it will be printed every time after training

@glenn-jocher

glenn-jocher · 2021-12-15T12:48:40Z

@Tears1997 👋 hi, thanks for letting us know about this possible problem with YOLOv5 🚀. I am not able to reproduce your bug. When I run the default training in our Colab notebook everything works correctly:

We've created a few short guidelines below to help users provide what we need in order to get started investigating a possible problem.

How to create a Minimal, Reproducible Example

When asking a question, people will be better able to provide help if you provide code that they can easily understand and use to reproduce the problem. This is referred to by community members as creating a minimum reproducible example. Your code that reproduces the problem should be:

✅ Minimal – Use as little code as possible to produce the problem
✅ Complete – Provide all parts someone else needs to reproduce the problem
✅ Reproducible – Test the code you're about to provide to make sure it reproduces the problem

For Ultralytics to provide assistance your code should also be:

✅ Current – Verify that your code is up-to-date with GitHub master, and if necessary git pull or git clone a new copy to ensure your problem has not already been solved in master.
✅ Unmodified – Your problem must be reproducible using official YOLOv5 code without changes. Ultralytics does not provide support for custom code ⚠️.

If you believe your problem meets all the above criteria, please close this issue and raise a new one using the 🐛 Bug Report template with a minimum reproducible example to help us better understand and diagnose your problem.

Thank you! 😃

LightDani · 2021-12-20T01:38:35Z

i faced the same problem on kaggle, but as @glenn-jocher said, on colab it complately works.

glenn-jocher · 2021-12-20T16:32:25Z

@LightDani @awsaf49 @T1M-CHEN good news 😃! Your original issue may now be fixed ✅ in PR #6041. This PR resets all logging handlers before running any commands, which fixes the Kaggle missing output bug. This does not resolve the original error message reported in this issue.

To receive this update:

Git – git pull from within your yolov5/ directory or git clone https://github.com/ultralytics/yolov5 again
PyTorch Hub – Force-reload model = torch.hub.load('ultralytics/yolov5', 'yolov5s', force_reload=True)
Notebooks – View updated notebooks
Docker – sudo docker pull ultralytics/yolov5:latest to update your image

Thank you for spotting this issue and informing us of the problem. Please let us know if this update resolves the issue for you, and feel free to inform us of any other issues you discover or feature requests that come to mind. Happy trainings with YOLOv5 🚀!

awsaf49 · 2021-12-20T18:15:51Z

@glenn-jocher Yes, you are right. now the logger is visible ... :D

github-actions · 2022-01-31T00:13:17Z

👋 Hello, this issue has been automatically marked as stale because it has not had recent activity. Please note it will be closed if no further activity occurs.

Access additional YOLOv5 🚀 resources:

Wiki – https://github.com/ultralytics/yolov5/wiki
Tutorials – https://docs.ultralytics.com/yolov5
Docs – https://docs.ultralytics.com

Access additional Ultralytics ⚡ resources:

Ultralytics HUB – https://ultralytics.com/hub
Vision API – https://ultralytics.com/yolov5
About Us – https://ultralytics.com/about
Join Our Team – https://ultralytics.com/work
Contact Us – https://ultralytics.com/contact

Feel free to inform us of any other issues you discover or feature requests that come to mind in the future. Pull Requests (PRs) are also always welcomed!

Thank you for your contributions to YOLOv5 🚀 and Vision AI ⭐!

MheadHero · 2022-03-25T12:27:55Z

Hi, I faced the same problem again in 2022 after the training finished? What should I do? I am a newbie.

glenn-jocher · 2022-03-25T12:37:07Z

@MheadHero 👋 hi, thanks for letting us know about this possible problem with YOLOv5 🚀. We've created a few short guidelines below to help users provide what we need in order to start investigating a possible problem.

How to create a Minimal, Reproducible Example

When asking a question, people will be better able to provide help if you provide code that they can easily understand and use to reproduce the problem. This is referred to by community members as creating a minimum reproducible example. Your code that reproduces the problem should be:

✅ Minimal – Use as little code as possible to produce the problem
✅ Complete – Provide all parts someone else needs to reproduce the problem
✅ Reproducible – Test the code you're about to provide to make sure it reproduces the problem

For Ultralytics to provide assistance your code should also be:

✅ Current – Verify that your code is up-to-date with GitHub master, and if necessary git pull or git clone a new copy to ensure your problem has not already been solved in master.
✅ Unmodified – Your problem must be reproducible using official YOLOv5 code without changes. Ultralytics does not provide support for custom code ⚠️.

If you believe your problem meets all the above criteria, please close this issue and raise a new one using the 🐛 Bug Report template with a minimum reproducible example to help us better understand and diagnose your problem.

Thank you! 😃

Suozz · 2022-04-07T04:59:25Z

I just run as this train.py --epochs 10 --data ./data/test.yaml --cfg models/yolov5s.yaml --weights '' --batch-size 128 --workers 1 --batch-size 10, and meet the same problem.
Note: I used the master branch code, and do not change any code

glenn-jocher · 2022-04-07T07:12:28Z

@Suozz you've passed --batch-size twice in your command. In any case your example is not reproducible example as no errors occur when I run this in Colab with COCO128.

We've created a few short guidelines below to help users provide what we need in order to start investigating a possible problem.

How to create a Minimal, Reproducible Example

When asking a question, people will be better able to provide help if you provide code that they can easily understand and use to reproduce the problem. This is referred to by community members as creating a minimum reproducible example. Your code that reproduces the problem should be:

✅ Minimal – Use as little code as possible to produce the problem
✅ Complete – Provide all parts someone else needs to reproduce the problem
✅ Reproducible – Test the code you're about to provide to make sure it reproduces the problem

For Ultralytics to provide assistance your code should also be:

✅ Current – Verify that your code is up-to-date with GitHub master, and if necessary git pull or git clone a new copy to ensure your problem has not already been solved in master.
✅ Unmodified – Your problem must be reproducible using official YOLOv5 code without changes. Ultralytics does not provide support for custom code ⚠️.

If you believe your problem meets all the above criteria, please close this issue and raise a new one using the 🐛 Bug Report template with a minimum reproducible example to help us better understand and diagnose your problem.

Thank you! 😃

haoshifu · 2022-09-01T02:38:25Z

Hello, has the problem been solved？

glenn-jocher · 2022-09-01T11:09:56Z

@haoshifu update your torch to the latest

shinianzhihou · 2022-11-04T02:56:53Z

ERROR：（....__del__.... \n AttributeError: 'NoneType' object has no attribute 'python exit_status）
Actually, it is a bug in PYTORCH. I have checked the source code about ‘torch.utils.data.dataloader._shutdown_workers’ and find the difference between torch1.7-torch1.12 and torch1.13 lies on:

# torch1.13, nice baby
if _utils is None or _utils.python_exit_status is True or _utils.python_exit_status is None: return

# torch1.7-1.12, bad guy
python_exit_status = _utils.python_exit_status
if python_exit_status is True or python_exit_status is None: return

So, the simple solution is that modify the source code from bad guy to nice baby.

glenn-jocher · 2023-11-15T17:41:25Z

@shinianzhihou Thank you for sharing your findings! It seems like you have identified a potential solution to the issue based on the differences you observed in the PyTorch source code.

You're welcome to create a pull request with your proposed modification to the YOLOv5 repository. Your contribution would be greatly appreciated by the community. This will allow the Ultralytics team to review your changes and consider incorporating them into the YOLOv5 codebase.

Thank you for taking the initiative to investigate this issue and suggesting a potential solution! If you have any further questions or need assistance with the pull request process, feel free to ask.

awsaf49 added the bug Something isn't working label Dec 7, 2021

glenn-jocher linked a pull request Dec 20, 2021 that will close this issue

Kaggle LOGGER fix #6041

Merged

glenn-jocher mentioned this issue Jan 13, 2022

set_logging stomps over root logging handlers #6278

Closed

2 tasks

github-actions bot added the Stale label Jan 31, 2022

github-actions bot closed this as completed Feb 7, 2022

glenn-jocher mentioned this issue May 26, 2022

AttributeError: 'NoneType' object has no attribute 'python_exit_status' #7995

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AttributeError: 'NoneType' object has no attribute 'python_exit_status' #5913

AttributeError: 'NoneType' object has no attribute 'python_exit_status' #5913

awsaf49 commented Dec 7, 2021

glenn-jocher commented Dec 8, 2021

awsaf49 commented Dec 9, 2021 •

edited

Loading

T1M-CHEN commented Dec 9, 2021

awsaf49 commented Dec 9, 2021

AyushExel commented Dec 9, 2021

T1M-CHEN commented Dec 9, 2021

glenn-jocher commented Dec 9, 2021

glenn-jocher commented Dec 9, 2021

bhachauk commented Dec 9, 2021

T1M-CHEN commented Dec 10, 2021

T1M-CHEN commented Dec 10, 2021

awsaf49 commented Dec 10, 2021

Tears1997 commented Dec 15, 2021 •

edited

Loading

glenn-jocher commented Dec 15, 2021 •

edited

Loading

LightDani commented Dec 20, 2021

glenn-jocher commented Dec 20, 2021

awsaf49 commented Dec 20, 2021

github-actions bot commented Jan 31, 2022 •

edited by glenn-jocher

Loading

MheadHero commented Mar 25, 2022

glenn-jocher commented Mar 25, 2022 •

edited

Loading

Suozz commented Apr 7, 2022

glenn-jocher commented Apr 7, 2022 •

edited

Loading

haoshifu commented Sep 1, 2022

glenn-jocher commented Sep 1, 2022

shinianzhihou commented Nov 4, 2022

glenn-jocher commented Nov 15, 2023

AttributeError: 'NoneType' object has no attribute 'python_exit_status' #5913

AttributeError: 'NoneType' object has no attribute 'python_exit_status' #5913

Comments

awsaf49 commented Dec 7, 2021

Search before asking

YOLOv5 Component

Bug

Environment

Minimal Reproducible Example

Additional

Are you willing to submit a PR?

glenn-jocher commented Dec 8, 2021

awsaf49 commented Dec 9, 2021 • edited Loading

T1M-CHEN commented Dec 9, 2021

awsaf49 commented Dec 9, 2021

AyushExel commented Dec 9, 2021

T1M-CHEN commented Dec 9, 2021

glenn-jocher commented Dec 9, 2021

glenn-jocher commented Dec 9, 2021

bhachauk commented Dec 9, 2021

T1M-CHEN commented Dec 10, 2021

T1M-CHEN commented Dec 10, 2021

awsaf49 commented Dec 10, 2021

Tears1997 commented Dec 15, 2021 • edited Loading

glenn-jocher commented Dec 15, 2021 • edited Loading

How to create a Minimal, Reproducible Example

LightDani commented Dec 20, 2021

glenn-jocher commented Dec 20, 2021

awsaf49 commented Dec 20, 2021

github-actions bot commented Jan 31, 2022 • edited by glenn-jocher Loading

MheadHero commented Mar 25, 2022

glenn-jocher commented Mar 25, 2022 • edited Loading

How to create a Minimal, Reproducible Example

Suozz commented Apr 7, 2022

glenn-jocher commented Apr 7, 2022 • edited Loading

How to create a Minimal, Reproducible Example

haoshifu commented Sep 1, 2022

glenn-jocher commented Sep 1, 2022

shinianzhihou commented Nov 4, 2022

glenn-jocher commented Nov 15, 2023

awsaf49 commented Dec 9, 2021 •

edited

Loading

Tears1997 commented Dec 15, 2021 •

edited

Loading

glenn-jocher commented Dec 15, 2021 •

edited

Loading

github-actions bot commented Jan 31, 2022 •

edited by glenn-jocher

Loading

glenn-jocher commented Mar 25, 2022 •

edited

Loading

glenn-jocher commented Apr 7, 2022 •

edited

Loading