Get qlora mistral-7b fine tuning working on a single 4090 #708

lukemarsden · 2023-10-09T16:17:03Z

Sets micro_batch_size from 4 to 2 in the mistral/qlora.yml example. With this change, qlora fine tuning of mistral-7b fits on a single 4090, per https://twitter.com/Teknium1/status/1709750388528939473.

I tried a few different values of micro_batch_size and gradient_accumulation_steps, as suggested by the readme, but I'm not an expert so please advise if there's a better way to do this.

NanoCode012

Sure, this would be helpful for most home users.

…-cloud#708)

Get qlora mistral-7b fine tuning working on a single 4090

7c70dc5

NanoCode012 approved these changes Oct 10, 2023

View reviewed changes

NanoCode012 merged commit 295b266 into axolotl-ai-cloud:main Oct 10, 2023

mkeoliya pushed a commit to mkeoliya/axolotl that referenced this pull request Dec 15, 2023

Get qlora mistral-7b fine tuning working on a single 4090 (axolotl-ai…

388b0e7

…-cloud#708)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get qlora mistral-7b fine tuning working on a single 4090 #708

Get qlora mistral-7b fine tuning working on a single 4090 #708

lukemarsden commented Oct 9, 2023 •

edited

Loading

NanoCode012 left a comment

Get qlora mistral-7b fine tuning working on a single 4090 #708

Get qlora mistral-7b fine tuning working on a single 4090 #708

Conversation

lukemarsden commented Oct 9, 2023 • edited Loading

NanoCode012 left a comment

Choose a reason for hiding this comment

lukemarsden commented Oct 9, 2023 •

edited

Loading