about finetune train #90

ZuyongWu · 2024-06-07T08:50:03Z

I only have 4x4090 cards，under this circumstance， can i finetune a MLLM？
How should I do to train， thanks a lot.

RussRobin · 2024-07-06T10:27:43Z

Thank you for your interest in Bunny.
24G per device is enough to pretrain and finetune Bunny. However, the actual GPU memory consumption depends on your base model, image resolution and data.

For finetuning, setting per-device-batch-size to 2 or 4 may be good to you. In order to use the default learning rate in finetune_lora.sh, we recommend keeping global batch size 128. Global batch size = num of GPU * batch size per GPU * accumulation step. In your case, num of GPU is 4. All these parameters can be set in finetune_lora.sh. Similarly, set batch size that fits for you in pretraining, of full parameter tuning.

Feel free to further comment on this issue if you meet any problems in using Bunny.

Regards
Russell

RussRobin · 2024-07-14T06:04:14Z

I'll close this issue since no further discussion is raised yet. Please reopen it if you still have concerns.

RussRobin closed this as completed Jul 14, 2024

Isaachhh mentioned this issue Jul 15, 2024

修改或融合视觉模块 #99

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

about finetune train #90

about finetune train #90

ZuyongWu commented Jun 7, 2024

RussRobin commented Jul 6, 2024 •

edited

Loading

RussRobin commented Jul 14, 2024

about finetune train #90

about finetune train #90

Comments

ZuyongWu commented Jun 7, 2024

RussRobin commented Jul 6, 2024 • edited Loading

RussRobin commented Jul 14, 2024

RussRobin commented Jul 6, 2024 •

edited

Loading