Exclude torch==1.12.0, torchvision==0.13.0 (Fix #8395) #8497

mjun0812 · 2022-07-06T15:35:53Z

There is a bug in PyTorch 1.12.0 regarding CUDA initialization.
Therefore, the environment variable "CUDA_VISIBLE_DEVICES" set after import torch will not work.

This is a fatal problem because YOLOv5 uses CUDA_VISIBLE_DEVICES to allocate GPUs.

yolov5/utils/torch_utils.py

Lines 52 to 85 in fdc9d91

    
           def select_device(device='', batch_size=0, newline=True): 
        
               # device = None or 'cpu' or 0 or '0' or '0,1,2,3' 
        
               s = f'YOLOv5 🚀 {git_describe() or file_date()} Python-{platform.python_version()} torch-{torch.__version__} ' 
        
               device = str(device).strip().lower().replace('cuda:', '').replace('none', '')  # to string, 'cuda:0' to '0' 
        
               cpu = device == 'cpu' 
        
               mps = device == 'mps'  # Apple Metal Performance Shaders (MPS) 
        
               if cpu or mps: 
        
                   os.environ['CUDA_VISIBLE_DEVICES'] = '-1'  # force torch.cuda.is_available() = False 
        
               elif device:  # non-cpu device requested 
        
                   os.environ['CUDA_VISIBLE_DEVICES'] = device  # set environment variable - must be before assert is_available() 
        
                   assert torch.cuda.is_available() and torch.cuda.device_count() >= len(device.replace(',', '')), \ 
        
                       f"Invalid CUDA '--device {device}' requested, use '--device cpu' or pass valid CUDA device(s)" 
        
               if not (cpu or mps) and torch.cuda.is_available():  # prefer GPU if available 
        
                   devices = device.split(',') if device else '0'  # range(torch.cuda.device_count())  # i.e. 0,1,6,7 
        
                   n = len(devices)  # device count 
        
                   if n > 1 and batch_size > 0:  # check batch_size is divisible by device_count 
        
                       assert batch_size % n == 0, f'batch-size {batch_size} not multiple of GPU count {n}' 
        
                   space = ' ' * (len(s) + 1) 
        
                   for i, d in enumerate(devices): 
        
                       p = torch.cuda.get_device_properties(i) 
        
                       s += f"{'' if i == 0 else space}CUDA:{d} ({p.name}, {p.total_memory / (1 << 20):.0f}MiB)\n"  # bytes to MB 
        
                   arg = 'cuda:0' 
        
               elif mps and getattr(torch, 'has_mps', False) and torch.backends.mps.is_available():  # prefer MPS if available 
        
                   s += 'MPS\n' 
        
                   arg = 'mps' 
        
               else:  # revert to CPU 
        
                   s += 'CPU\n' 
        
                   arg = 'cpu' 
        
               if not newline: 
        
                   s = s.rstrip() 
        
               LOGGER.info(s.encode().decode('ascii', 'ignore') if platform.system() == 'Windows' else s)  # emoji-safe 
        
               return torch.device(arg)

As mentioned in this Issue, the next version of PyTorch 1.12.1 will resolve this issue, so it is better to remove the buggy version 1.12.0 and the associated torchvision version from the requirements.txt file.

If you have any comments, please do not hesitate to let me know.
Thanks.

🛠️ PR Summary

_{Made with ❤️ by Ultralytics Actions}

🌟 Summary

Updated PyTorch requirements to exclude buggy versions.

📊 Key Changes

Excluded PyTorch version 1.12.0 due to identified issues.
Excluded torchvision version 0.13.0 as it's not compatible with current codebase.

🎯 Purpose & Impact

🎯 Purpose: To prevent installation of specific PyTorch and torchvision versions that are known to cause problems with the YOLOv5 code, ensuring stability and reliability for users.
💥 Impact: Users will avoid potential bugs by not installing these problematic versions, leading to a smoother experience with the YOLOv5 project.

github-actions

👋 Hello @mjun0812, thank you for submitting a YOLOv5 🚀 PR! To allow your work to be integrated as seamlessly as possible, we advise you to:

✅ Verify your PR is up-to-date with upstream/master. If your PR is behind upstream/master an automatic GitHub Actions merge may be attempted by writing /rebase in a new comment, or by running the following code, replacing 'feature' with the name of your local branch:

git remote add upstream https://github.com/ultralytics/yolov5.git
git fetch upstream
# git checkout feature  # <--- replace 'feature' with local branch name
git merge upstream/master
git push -u origin -f

✅ Verify all Continuous Integration (CI) checks are passing.
✅ Reduce changes to the absolute minimum required for your bug fix or feature addition. "It is not daily increase but daily decrease, hack away the unessential. The closer to the source, the less wastage there is." -Bruce Lee

glenn-jocher · 2022-07-07T11:03:28Z

@mjun0812 thanks for the investigation and the PR. PR is merged!

…tralytics#8497) Exclude torch==1.12.0, torchvision==0.13.0

Exclude torch==1.12.0, torchvision==0.13.0

b93e76c

github-actions bot reviewed Jul 6, 2022

View reviewed changes

glenn-jocher self-assigned this Jul 6, 2022

glenn-jocher merged commit 1ab23fc into ultralytics:master Jul 6, 2022

glenn-jocher mentioned this pull request Jul 7, 2022

YOLOv5 issues with torch==1.12 on Multi-GPU systems #8395

Closed

2 tasks

Shivvrat pushed a commit to Shivvrat/epic-yolov5 that referenced this pull request Jul 12, 2022

Exclude torch==1.12.0, torchvision==0.13.0 (Fix ultralytics#8395) (ul…

ce250f9

…tralytics#8497) Exclude torch==1.12.0, torchvision==0.13.0

zhiqwang mentioned this pull request Jul 15, 2022

About pyotrch version #8581

Closed

1 task

glenn-jocher mentioned this pull request Jul 18, 2022

AttributeError: 'NoneType' object has no attribute '_free_weak_ref' pytorch/pytorch#74016

Closed

ctjanuhowski pushed a commit to ctjanuhowski/yolov5 that referenced this pull request Sep 8, 2022

Exclude torch==1.12.0, torchvision==0.13.0 (Fix ultralytics#8395) (ul…

24a6c36

…tralytics#8497) Exclude torch==1.12.0, torchvision==0.13.0

Hojland mentioned this pull request Oct 17, 2022

feat/bump Go-Autonomous/yolov5#15

Merged

mjun0812 mentioned this pull request Oct 19, 2022

import kornia break CUDA lazy init kornia/kornia#1951

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Exclude torch==1.12.0, torchvision==0.13.0 (Fix #8395) #8497

Exclude torch==1.12.0, torchvision==0.13.0 (Fix #8395) #8497

mjun0812 commented Jul 6, 2022 •

edited by UltralyticsAssistant

Loading

github-actions bot left a comment

glenn-jocher commented Jul 7, 2022

	def select_device(device='', batch_size=0, newline=True):
	# device = None or 'cpu' or 0 or '0' or '0,1,2,3'
	s = f'YOLOv5 🚀 {git_describe() or file_date()} Python-{platform.python_version()} torch-{torch.__version__} '
	device = str(device).strip().lower().replace('cuda:', '').replace('none', '') # to string, 'cuda:0' to '0'
	cpu = device == 'cpu'
	mps = device == 'mps' # Apple Metal Performance Shaders (MPS)
	if cpu or mps:
	os.environ['CUDA_VISIBLE_DEVICES'] = '-1' # force torch.cuda.is_available() = False
	elif device: # non-cpu device requested
	os.environ['CUDA_VISIBLE_DEVICES'] = device # set environment variable - must be before assert is_available()
	assert torch.cuda.is_available() and torch.cuda.device_count() >= len(device.replace(',', '')), \
	f"Invalid CUDA '--device {device}' requested, use '--device cpu' or pass valid CUDA device(s)"

	if not (cpu or mps) and torch.cuda.is_available(): # prefer GPU if available
	devices = device.split(',') if device else '0' # range(torch.cuda.device_count()) # i.e. 0,1,6,7
	n = len(devices) # device count
	if n > 1 and batch_size > 0: # check batch_size is divisible by device_count
	assert batch_size % n == 0, f'batch-size {batch_size} not multiple of GPU count {n}'
	space = ' ' * (len(s) + 1)
	for i, d in enumerate(devices):
	p = torch.cuda.get_device_properties(i)
	s += f"{'' if i == 0 else space}CUDA:{d} ({p.name}, {p.total_memory / (1 << 20):.0f}MiB)\n" # bytes to MB
	arg = 'cuda:0'
	elif mps and getattr(torch, 'has_mps', False) and torch.backends.mps.is_available(): # prefer MPS if available
	s += 'MPS\n'
	arg = 'mps'
	else: # revert to CPU
	s += 'CPU\n'
	arg = 'cpu'

	if not newline:
	s = s.rstrip()
	LOGGER.info(s.encode().decode('ascii', 'ignore') if platform.system() == 'Windows' else s) # emoji-safe
	return torch.device(arg)

Exclude torch==1.12.0, torchvision==0.13.0 (Fix #8395) #8497

Exclude torch==1.12.0, torchvision==0.13.0 (Fix #8395) #8497

Conversation

mjun0812 commented Jul 6, 2022 • edited by UltralyticsAssistant Loading

🛠️ PR Summary

🌟 Summary

📊 Key Changes

🎯 Purpose & Impact

github-actions bot left a comment

Choose a reason for hiding this comment

glenn-jocher commented Jul 7, 2022

mjun0812 commented Jul 6, 2022 •

edited by UltralyticsAssistant

Loading