Fix YoloNAS on cuda #1444

Louis-Dupont · 2023-09-03T08:48:14Z

Reproduce bug

import torch
from super_gradients.common.object_names import Models
from super_gradients.training import models

# Note that currently only YoloX, PPYoloE and YOLO-NAS are supported.
model = models.get(Models.YOLO_NAS_L, pretrained_weights="coco")

# We want to use cuda if available to speed up inference.
model = model.to("cuda" if torch.cuda.is_available() else "cpu")

IMAGES = [
    "../../../../documentation/source/images/examples/countryside.jpg",
    "../../../../documentation/source/images/examples/street_busy.jpg",
    "https://cdn-attachments.timesofmalta.com/cc1eceadde40d2940bc5dd20692901371622153217-1301777007-4d978a6f-620x348.jpg",
]

predictions = model.predict(IMAGES)
predictions.show()
predictions.save(output_folder="")  # Save in working directory

Exception

    shift_x = torch.arange(end=w, dtype=dtype) + self.grid_cell_offset
RuntimeError: "arange_cpu" not implemented for 'Half'

pytorch == '1.12.0+cu102'

Solutions

remove dtype
check pytorch version and use dtype only if user has the right version. I am not sure what's the best way to find which version supports this, except by trying them one by one.

Should we go for 1. or do we still want to add dtype when possible ? What was the motivation to add it ? @BloodAxe

BloodAxe · 2023-09-03T10:08:34Z

Dtype is needed since otherwise it causes fp64 types appear in onnx graph.
But I'm not sure where fp16 issue is coming from. I will take a look, thanks for raising this issue

Pbatch · 2023-10-04T15:37:03Z

Is it possible to add just the device to torch.arange instead?

I.e.

shift_x = torch.arange(end=w, device=device, dtype=dtype) + self.grid_cell_offset

Creating a torch.float16 tensor is fine if it's made on the GPU.

…e_to_dtype

Louis-Dupont

LGTM

BloodAxe

LGTM

fix

d81dfc2

Louis-Dupont requested review from shaydeci, ofrimasad and BloodAxe as code owners September 3, 2023 08:48

Louis-Dupont marked this pull request as draft September 3, 2023 09:33

Louis-Dupont changed the title ~~fix~~ Fix YoloNAS on cuda Sep 3, 2023

BloodAxe added 2 commits October 11, 2023 16:40

Merge branch 'master' into hotfix/SG-000-fix_model_predict_on_cuda_du…

36cd791

…e_to_dtype

Fixed creation of torch.arange with fp16 dtype

d54c9c4

BloodAxe marked this pull request as ready for review October 11, 2023 14:02

Louis-Dupont commented Oct 12, 2023

View reviewed changes

BloodAxe approved these changes Oct 12, 2023

View reviewed changes

BloodAxe merged commit ecdec5e into master Oct 12, 2023
7 checks passed

BloodAxe deleted the hotfix/SG-000-fix_model_predict_on_cuda_due_to_dtype branch October 12, 2023 06:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix YoloNAS on cuda #1444

Fix YoloNAS on cuda #1444

Louis-Dupont commented Sep 3, 2023

BloodAxe commented Sep 3, 2023

Pbatch commented Oct 4, 2023

Louis-Dupont left a comment

BloodAxe left a comment

Fix YoloNAS on cuda #1444

Fix YoloNAS on cuda #1444

Conversation

Louis-Dupont commented Sep 3, 2023

Reproduce bug

Solutions

BloodAxe commented Sep 3, 2023

Pbatch commented Oct 4, 2023

Louis-Dupont left a comment

Choose a reason for hiding this comment

BloodAxe left a comment

Choose a reason for hiding this comment