fix: torch_dtype mistral default to fp32 #1050

NanoCode012 · 2024-01-05T14:17:44Z

Single GPU Mistral FFT training would result in torch_dtype: float32, which in turn will save checkpoints in full weights. This fixes it.

This issue does not occur with multi-gpu nor lora training.

Ref: #904

fix: torch_dtype mistral default to fp32

605c719

NanoCode012 force-pushed the fix/torch_dtype_mistral branch from fec2590 to 605c719 Compare January 7, 2024 03:38

winglian approved these changes Jan 9, 2024

View reviewed changes

winglian merged commit c3e8165 into axolotl-ai-cloud:main Jan 9, 2024
6 checks passed

NanoCode012 deleted the fix/torch_dtype_mistral branch March 30, 2024 18:13

Provide feedback