minor fixes 20231211 #943

winglian · 2023-12-12T03:58:41Z

No description provided.

NanoCode012

Could we also add nano in base docker :)

NanoCode012 · 2023-12-12T04:53:04Z

src/axolotl/utils/models.py

@@ -191,6 +191,7 @@ def load_model(

    # TODO refactor as a kwarg


there's this todo here.

I'm not even sure what this TODO is for atm

maybe it's for the .from_pretrained call where these are kwargs for that? should we add this to model_kwargs and remove those from all the from_pretrained calls?

This is just above

load_in_8bit = cfg.load_in_8bit load_in_4bit = cfg.load_in_4bit

NanoCode012 · 2023-12-12T05:01:14Z

src/axolotl/utils/models.py

@@ -535,7 +536,7 @@ def load_model(

    model, lora_config = load_adapter(model, cfg, cfg.adapter)

-    if cfg.ddp and not load_in_8bit:
+    if cfg.ddp and not load_in_8bit and not load_in_4bit:
        model.to(f"cuda:{cfg.local_rank}")


what's the side effect of this change? Does it mean, the models will all live on gpu0 for 4 bit now?

there's an edge case that I had to have this set when doing DPO qlora for Mixstral. maybe it was a deepspeed thing and I should change this?

NanoCode012 · 2023-12-12T05:36:27Z

I found that you separated your apt installs into 2 separate docker file. Maybe, these should also be moved to base

https://github.com/OpenAccess-AI-Collective/axolotl/blob/7fabc4d95e9b34e0cfaace2a497cd4fedc43db3b/docker/Dockerfile#L13C1-L13C32

misc fixes

2de1bfa

winglian changed the title ~~misc fixes~~ minor fixes 20231211 Dec 12, 2023

NanoCode012 reviewed Dec 12, 2023

View reviewed changes

nano editor in base docker

2f600e2

move nano install to axolotl image

8c78197

winglian added a commit that referenced this pull request Jan 10, 2024

misc fixes from #943

b02bf3e

winglian mentioned this pull request Jan 10, 2024

misc fixes from #943 #1086

Merged

winglian closed this Jan 10, 2024

winglian added a commit that referenced this pull request Jan 11, 2024

misc fixes from #943 (#1086) [skip ci]

23495a8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

minor fixes 20231211 #943

minor fixes 20231211 #943

winglian commented Dec 12, 2023

NanoCode012 left a comment

NanoCode012 Dec 12, 2023

winglian Dec 12, 2023

winglian Dec 12, 2023

NanoCode012 Dec 12, 2023

NanoCode012 Dec 12, 2023

winglian Dec 12, 2023

NanoCode012 commented Dec 12, 2023

		@@ -191,6 +191,7 @@ def load_model(

		# TODO refactor as a kwarg

minor fixes 20231211 #943

minor fixes 20231211 #943

Conversation

winglian commented Dec 12, 2023

NanoCode012 left a comment

Choose a reason for hiding this comment

NanoCode012 Dec 12, 2023

Choose a reason for hiding this comment

winglian Dec 12, 2023

Choose a reason for hiding this comment

winglian Dec 12, 2023

Choose a reason for hiding this comment

NanoCode012 Dec 12, 2023

Choose a reason for hiding this comment

NanoCode012 Dec 12, 2023

Choose a reason for hiding this comment

winglian Dec 12, 2023

Choose a reason for hiding this comment

NanoCode012 commented Dec 12, 2023