ensure merged model matches the training dtype #902

winglian · 2023-11-28T21:14:30Z

seems like all the lora merges are fp16. matching the training would open up bfloat16 support which should be better.

src/axolotl/cli/__init__.py

NanoCode012 · 2023-11-29T13:51:30Z

Perhaps, some individuals "might" want fp32? We should respect same dtype as in training. You might want to do model.to(cfg.torch_dtype). This would load the one from

https://github.com/OpenAccess-AI-Collective/axolotl/blob/71b7ea3c056f15123b56fef3151b4044c80078b4/src/axolotl/utils/config.py#L73-L78

src/axolotl/cli/__init__.py

* ensure merged model matches the training dtype * Update src/axolotl/cli/__init__.py * Update src/axolotl/cli/__init__.py

ensure merged model matches the training dtype

5f2df7e

winglian commented Nov 29, 2023

View reviewed changes

src/axolotl/cli/__init__.py Outdated Show resolved Hide resolved

Update src/axolotl/cli/__init__.py

ee32a33

winglian requested a review from NanoCode012 November 29, 2023 13:47

winglian commented Nov 29, 2023

View reviewed changes

src/axolotl/cli/__init__.py Outdated Show resolved Hide resolved

Update src/axolotl/cli/__init__.py

f15fd0c

winglian merged commit 1d21aa6 into main Nov 29, 2023
4 checks passed

winglian deleted the lora-merge-dtype branch November 29, 2023 14:55

mkeoliya pushed a commit to mkeoliya/axolotl that referenced this pull request Dec 15, 2023

ensure merged model matches the training dtype (axolotl-ai-cloud#902)

a2edaf0

* ensure merged model matches the training dtype * Update src/axolotl/cli/__init__.py * Update src/axolotl/cli/__init__.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ensure merged model matches the training dtype #902

ensure merged model matches the training dtype #902

winglian commented Nov 28, 2023

NanoCode012 commented Nov 29, 2023 •

edited

Loading

ensure merged model matches the training dtype #902

ensure merged model matches the training dtype #902

Conversation

winglian commented Nov 28, 2023

NanoCode012 commented Nov 29, 2023 • edited Loading

NanoCode012 commented Nov 29, 2023 •

edited

Loading