Allow users to choose devices for models #264

BramVanroy · 2023-09-02T10:26:27Z

Currently, there are some issues when loading a transformers model because it explicitly calls .to(device). The default is "cpu" (which is already the default in transformers to begin with). So I recommend to remove device as an argument completely and let the user decide how they want to load the model. That allows things like device_map and loading in kbit. Internally inside the Transformers class nothing changes, but we set self.device to model.device to ensure that everything stays compatible in the rest of the library.

closes #238

giraffeingreen · 2023-09-04T08:43:59Z

This addresses a problem that pops up when you've got several GPUs in play. When you set the device-map to 'auto', the model loads onto multiple GPUs just fine. But then, when you try to do 'model.to(device)', it tries to cram everything onto just one GPU.

brandonwillard · 2023-09-06T23:38:36Z

I've updated this branch with some formatting fixes and test/docstring changes from @rlouf.

brandonwillard · 2023-09-07T00:01:49Z

tests/text/generate/test_integration_transfomers.py

@@ -114,7 +114,7 @@ def test_transformers_integration_choice():

 def test_transformers_integration_with_pad_token():
    model_name = "hf-internal-testing/tiny-random-XLMRobertaXLForCausalLM"
-    model = models.transformers(model_name, device="cpu")
+    model = models.transformers(model_name, device="meta")


This was necessary in order to get around some "meta device" issues related to accelerate.

rlouf · 2023-09-07T16:27:39Z

Thank you @BramVanroy !

brandonwillard force-pushed the main branch from 7f705d6 to 4b83ee1 Compare September 6, 2023 23:37

brandonwillard force-pushed the main branch from 4b83ee1 to 4f4a3ec Compare September 6, 2023 23:49

Allow user to choose device for models

476a8c9

brandonwillard force-pushed the main branch from 4f4a3ec to 476a8c9 Compare September 7, 2023 00:00

brandonwillard requested a review from rlouf September 7, 2023 00:01

brandonwillard reviewed Sep 7, 2023

View reviewed changes

brandonwillard changed the title ~~ADD: more freedom for model loading~~ Allow users to choose devices for models Sep 7, 2023

brandonwillard added enhancement transformers Linked to the `transformers` integration labels Sep 7, 2023

brandonwillard approved these changes Sep 7, 2023

View reviewed changes

xandernewton mentioned this pull request Sep 7, 2023

Llama Models decoding produces white spaces between characters #273

Closed

rlouf merged commit ce0fad4 into outlines-dev:main Sep 7, 2023
4 checks passed

rlouf mentioned this pull request Nov 9, 2023

Fixed loading quantized model #231

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow users to choose devices for models #264

Allow users to choose devices for models #264

BramVanroy commented Sep 2, 2023 •

edited

Loading

giraffeingreen commented Sep 4, 2023

brandonwillard commented Sep 6, 2023

brandonwillard Sep 7, 2023

rlouf commented Sep 7, 2023

Allow users to choose devices for models #264

Allow users to choose devices for models #264

Conversation

BramVanroy commented Sep 2, 2023 • edited Loading

giraffeingreen commented Sep 4, 2023

brandonwillard commented Sep 6, 2023

brandonwillard Sep 7, 2023

Choose a reason for hiding this comment

rlouf commented Sep 7, 2023

BramVanroy commented Sep 2, 2023 •

edited

Loading