Support Qwen type llm's #638

CyberTimon · 2023-09-26T17:57:00Z

⚠️ Please check that this feature request hasn't been suggested before.

I searched previous Ideas in Discussions didn't find any similar feature requests.
I searched previous Issues didn't find any similar feature requests.

🔖 Feature description

With the new qwen 14b model it would be very nice to have axolotl support.

✔️ Solution

-Support finetuning of Qwen type llm's

❓ Alternatives

Normal finetuning but it's unoptimized

📝 Additional Context

No response

Acknowledgements

My issue title is concise, descriptive, and in title casing.
I have searched the existing issues to make sure this feature has not been requested yet.
I have provided enough information for the maintainers to understand and evaluate this request.

NanoCode012 · 2023-10-05T19:29:13Z

Could you please try with trust_remote_code: true and using AutoModelCausalLM and AutoTokenizer? Is there an error?

Peter-Devine · 2023-10-30T08:27:13Z

I'd like to second this. trust_remote_code: true seems to work for full finetuning, but I get errors for qlora training. Is fully tested support for Qwen on the roadmap for this project? It would be really useful for CJK languages!

CheshireAI · 2023-10-31T12:46:44Z

I get errors: raise ValueError("Adding unknown special tokens is not supported")
ValueError: Adding unknown special tokens is not supported
trying to use qlora with qwen

kostum123 · 2023-10-31T12:55:21Z

Has anyone been able to fine-tune it with qlora?

NanoCode012 · 2023-11-02T09:29:09Z

I get errors: raise ValueError("Adding unknown special tokens is not supported") ValueError: Adding unknown special tokens is not supported trying to use qlora with qwen

@CheshireAI , Hm, this is an upstream problem. If you want this feature, you should ask at transformers.

@Peter-Devine @kostum123 , what's the error with running qlora?

NanoCode012 · 2023-11-24T16:03:58Z

Hello, I apologize for the long delay. This should be fixed here: #894

Please let me know how it goes.

CyberTimon · 2023-11-24T20:30:38Z

Thank you very much! Will try it out soon.

CyberTimon added the enhancement New feature or request label Sep 26, 2023

CyberTimon closed this as completed Nov 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Qwen type llm's #638

Support Qwen type llm's #638

CyberTimon commented Sep 26, 2023

NanoCode012 commented Oct 5, 2023

Peter-Devine commented Oct 30, 2023

CheshireAI commented Oct 31, 2023

kostum123 commented Oct 31, 2023

NanoCode012 commented Nov 2, 2023

NanoCode012 commented Nov 24, 2023

CyberTimon commented Nov 24, 2023

Support Qwen type llm's #638

Support Qwen type llm's #638

Comments

CyberTimon commented Sep 26, 2023

⚠️ Please check that this feature request hasn't been suggested before.

🔖 Feature description

✔️ Solution

❓ Alternatives

📝 Additional Context

Acknowledgements

NanoCode012 commented Oct 5, 2023

Peter-Devine commented Oct 30, 2023

CheshireAI commented Oct 31, 2023

kostum123 commented Oct 31, 2023

NanoCode012 commented Nov 2, 2023

NanoCode012 commented Nov 24, 2023

CyberTimon commented Nov 24, 2023