sat-12l-sm running on GPU #120

Randwow · 2024-07-02T13:47:30Z

I'm trying to use sat-12l-sm on GPU with the following code:

MODEL_NAME = "sat-12l-sm"
sat = SaT(MODEL_NAME)
sat.to("cuda")

However, when I run nvidia-smi in the terminal, it doesn't show any usage of the GPU, and it seems that the GPU is not being utilized. Could you please provide any guidance or suggestions on how to ensure that the model is actually using the GPU?

Thank you!

The text was updated successfully, but these errors were encountered:

markus583 · 2024-07-03T11:28:20Z

Hi, this is a bit odd, and should not be the case. Is your torch properly set up? Did you check with nvidia-smi also after calling sat.split("some text")?
I just tried it myself and it works as intended. The time needed is also much lower after doing sat.cuda(). I suggest you compare time needed to segment some sentences on CPU and GPU; the latter should be an order of magnitude faster.

Randwow · 2024-07-03T12:27:15Z

Hi, yes I will check thank you )

lifeiteng · 2024-07-26T09:21:29Z

Hi, this is a bit odd, and should not be the case. Is your torch properly set up? Did you check with nvidia-smi also after calling sat.split("some text")? I just tried it myself and it works as intended. The time needed is also much lower after doing sat.cuda(). I suggest you compare time needed to segment some sentences on CPU and GPU; the latter should be an order of magnitude faster.

can we set cuda index?

markus583 · 2024-07-26T09:43:58Z

Hi, this is a bit odd, and should not be the case. Is your torch properly set up? Did you check with nvidia-smi also after calling sat.split("some text")? I just tried it myself and it works as intended. The time needed is also much lower after doing sat.cuda(). I suggest you compare time needed to segment some sentences on CPU and GPU; the latter should be an order of magnitude faster.

can we set cuda index?

Should be possible, yes. It is no different than other PyTorch models in this regard.

lifeiteng · 2024-07-26T10:32:34Z

There is a bug in class PyTorchWrapper

sat = SaT("sat-3l")

# this is not right
sat = sat.to(device)

# I used this
_ = sat.model.model.to(device)

# source code
class PyTorchWrapper:
    def __init__(self, model):
        self.model = model
        self.config = model.config

    def __getattr__(self, name):
        assert hasattr(self, "model")
        return getattr(self.model, name)

markus583 · 2024-07-26T11:12:17Z

Why is this a bug? I have been using sat = sat.to("cuda") and it worked just fine. Is it any different if you use, e.g., "cuda:2"?

markus583 self-assigned this Jul 3, 2024

markus583 closed this as completed Jul 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sat-12l-sm running on GPU #120

sat-12l-sm running on GPU #120

Randwow commented Jul 2, 2024

markus583 commented Jul 3, 2024

Randwow commented Jul 3, 2024

lifeiteng commented Jul 26, 2024

markus583 commented Jul 26, 2024

lifeiteng commented Jul 26, 2024

markus583 commented Jul 26, 2024

sat-12l-sm running on GPU #120

sat-12l-sm running on GPU #120

Comments

Randwow commented Jul 2, 2024

markus583 commented Jul 3, 2024

Randwow commented Jul 3, 2024

lifeiteng commented Jul 26, 2024

markus583 commented Jul 26, 2024

lifeiteng commented Jul 26, 2024

markus583 commented Jul 26, 2024