feat: add support for GPTQ-for-LLaMa #1055

mudler · 2023-09-14T10:47:51Z

Tracker to add support to https://github.com/qwopqwop200/GPTQ-for-LLaMa

localai-bot · 2023-09-25T12:29:56Z

⚠️⚠️⚠️⚠️⚠️

Hi! I'm a bot running with LocalAI ( a crazy experiment of @mudler ) - please beware that I might hallucinate sometimes!

but.... I can also be funny or helpful 😸 and I can provide generally speaking good tips or places where to look after in the documentation or in the code based on what you wrote in the issue.

Don't engage in conversation with me,
I don't support (yet) replying!

⚠️⚠️⚠️⚠️⚠️

I can help you with that! GPTQ is a powerful language model that can be used for various natural language processing tasks such as text classification, translation, and summarization. To add support for GPTQ to LocalAI, we need to first install the GPTQ package and then integrate it with the LLaMa backend. Here are the steps:

Install GPTQ by following the instructions on the official website: https://github.com/qwopqwop200/GPTQ
Clone the LLaMa repository from GitHub: git clone https://github.com/go-skynet/LocalAI.git
Open the LLaMa directory and navigate to the models subdirectory.
Create a new directory for GPTQ models and download the pre-trained model using the following command: curl -O https://github.com/raw/qwopqwop200/GPTQ/master/models/gptq_base_uncased.h5
Copy the GPTQ tokenizer and tokenizer_data files from the tokenizers directory to the new GPTQ models directory.
Add the following lines to the llama.cfg file in the root directory:

[models]
gptq = "path/to/gptq_base_uncased.h5"
gptq_tokenizer = "path/to/gptq_tokenizer.py"
gptq_tokenizer_data = "path/to/gptq_tokenizer_data.txt"

Replace the path to the GPTQ model, tokenizer, and tokenizer data files with the actual paths on your system.
7. Save the llama.cfg file and run the following command to load the GPTQ model:

python -c "from llama import llama; llama.load('gptq')"

This should load the GPTQ model into the LLaMa backend, and you can now use it for various language processing tasks.
I hope this helps! Let me know if you have any further questions or issues.

Sources:

Aisuko · 2023-10-14T10:29:09Z

According to the README.md of the project, it only support AutoGPTQ. We already have a AutoGPTQ backend but it still need to be test.

mudler · 2024-01-24T11:15:34Z

yes, this became less relevant as now the author focuses on AutoGPTQ. Closing

mudler added the enhancement New feature or request label Sep 14, 2023

mudler self-assigned this Sep 14, 2023

mudler added the roadmap label Sep 14, 2023

mudler mentioned this issue Oct 2, 2023

[EPIC] Model support dashboard (v2) #1126

Open

mudler added the new-backend label Oct 2, 2023

mudler added the area/backends label Jan 24, 2024

mudler removed their assignment Jan 24, 2024

mudler added the up for grabs Tickets that no-one is currently working on label Jan 24, 2024

mudler closed this as completed Jan 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add support for GPTQ-for-LLaMa #1055

feat: add support for GPTQ-for-LLaMa #1055

mudler commented Sep 14, 2023

localai-bot commented Sep 25, 2023

Aisuko commented Oct 14, 2023

mudler commented Jan 24, 2024

feat: add support for GPTQ-for-LLaMa #1055

feat: add support for GPTQ-for-LLaMa #1055

Comments

mudler commented Sep 14, 2023

localai-bot commented Sep 25, 2023

⚠️⚠️⚠️⚠️⚠️

⚠️⚠️⚠️⚠️⚠️

Aisuko commented Oct 14, 2023

mudler commented Jan 24, 2024