[BUG] <title> 我使用Q-lora对Qwen-VL-Chat-Int4进行了微调，我尝试对模型进行合并，报错。 #483

AuTingU · 2024-10-22T09:36:18Z

是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this?

我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions

该问题是否在FAQ中有解答？ | Is there an existing answer for this in FAQ?

我已经搜索过FAQ | I have searched FAQ

当前行为 | Current Behavior

这是合并的代码
from peft import AutoPeftModelForCausalLM
path_to_adapter = "/opt/Qwen-VL/Qwen-VL-master/output_qwen"
model = AutoPeftModelForCausalLM.from_pretrained(
path_to_adapter, # path to the output directory
device_map="auto",
trust_remote_code=True
).eval()

merged_model = model.merge_and_unload()

max_shard_size and safe serialization are not necessary.

They respectively work for sharding checkpoint and save the model to safetensors

merged_model.save_pretrained(new_model_directory, max_shard_size="2048MB", safe_serialization=True)

期望行为 | Expected Behavior

如何解决呢？或者有没有人告诉我Qwen-VL-Chat-Int4 如何微调，然后使用微调后的模型呢？

复现方法 | Steps To Reproduce

No response

运行环境 | Environment

- OS:
- Python:
- Transformers:
- PyTorch:
- CUDA (`python -c 'import torch; print(torch.version.cuda)'`):

备注 | Anything else?

No response

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] <title> 我使用Q-lora对Qwen-VL-Chat-Int4进行了微调，我尝试对模型进行合并，报错。 #483

[BUG] <title> 我使用Q-lora对Qwen-VL-Chat-Int4进行了微调，我尝试对模型进行合并，报错。 #483

AuTingU commented Oct 22, 2024

[BUG] <title> 我使用Q-lora对Qwen-VL-Chat-Int4进行了微调，我尝试对模型进行合并，报错。 #483

[BUG] <title> 我使用Q-lora对Qwen-VL-Chat-Int4进行了微调，我尝试对模型进行合并，报错。 #483

Comments

AuTingU commented Oct 22, 2024

是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this?

该问题是否在FAQ中有解答？ | Is there an existing answer for this in FAQ?

当前行为 | Current Behavior

max_shard_size and safe serialization are not necessary.

They respectively work for sharding checkpoint and save the model to safetensors

期望行为 | Expected Behavior

复现方法 | Steps To Reproduce

运行环境 | Environment

备注 | Anything else?