After fine-tuning, the model repeat the answer and not stop #467

abdoelsayed2016 · 2023-05-19T17:58:52Z

After fine-tuning, I have encountered an unexpected behavior where the model repeatedly generates the same answer without stopping.

i tried different models of llama

decapoda-research/llama-7b-hf
yahma/llama-7b-hf
here are some examples

Tell me about alpacas.


1. Alpacas are from South America.
2. Alpacas are from South America.
3. Alpacas are from South America.
4. Alpacas are from South America.
5. Alpacas are from South America.
6. Alpacas are from South America.
7. Alpacas are from South America.
8. Alpacas are from South America.
9. Alpacas are from South America.
10. Alpacas are from South America.
11. Alpac

who played charlie bucket in the original charlie and the chocolate factory

\begin{itemize}
\item Freddie Highmore
\item Freddie Highmore
\item Freddie Highmore
\item Freddie Highmore
\item Freddie Highmore
\item Freddie Highmore
\item Freddie Highmore
\item Freddie Highmore
\item Freddie Highmore
\item Freddie Highmore
\item Freddie Highmore
\item Freddie Highmore
\item Freddie Highmore
\item Freddie Highmore
\item Freddie Highmore
\item Freddie Highmore
\item Freddie Highmore
\

Also, adapter_model.bin size is 1kb

The text was updated successfully, but these errors were encountered:

aryanq123 · 2023-05-20T12:18:34Z

can someone assign me this task i wants to work upon it

and explain briefly

Qubitium · 2023-05-20T12:23:29Z

@abdoelsayed2016 You using an old model that is no longer recommended as it was convertes to hf using wrong tokenizer params. Check the repo readme for valid 7b hf models.

abdoelsayed2016 · 2023-05-20T16:01:03Z

@diegomontoya I have already read the readme many times. I see that while finetuning the base model, is decapoda-research/llama-7b-hf
I have two questions. First, why does the model repeat the answer? The model doesn't know when it should stop

The second question is whether the adapter weight is 1KB only. Is that normal?

abdoelsayed2016 · 2023-05-20T16:13:05Z

Also i tired huggyllama/llama-7b

abdoelsayed2016 · 2023-05-20T17:11:06Z

I see a lot of issues talking about the size of the adapter I tried all solutions, but still, the adapter is the same size 1KB.

anyone has faced the same issue cuz training is costly

asenmitrev · 2023-05-20T17:41:23Z

I was able to solve this issue with this comment. Uninstall peft and install from github commit.

I was desperate with the repetitions for both models (yahma and decapoda-research) but I think the root cause is the empty adapter model.bin.

If you want to do a quick test without training, take the last checkpoint pytorch_model.bin and replace the adapter_model.bin with that, renaming it. This allowed me to test the model and the repetition was fixed.

abdoelsayed2016 · 2023-05-20T17:45:35Z

@asenmitrev

Yesterday I did this solution but still adapter model.bin size 1KB


pip uninstall peft -y
pip install git+https://github.com/huggingface/peft.git@e536616888d51b453ed354a6f1e243fecb02ea08

i tried again with uninstall peft. Here is the output

abdoelsayed2016 · 2023-05-21T13:52:52Z

I also tried this solution
huggingface/peft#286
#37 (comment)

The adapter size now is 7 GB

And model back to repeat the answers.

I also changed the checkpoint model name from pytorch_model.bin to the adapter_model.bin

same output

asenmitrev · 2023-05-21T14:17:07Z

I tried decapoda research and the answer is repeated. yahma is fine

abdoelsayed2016 · 2023-05-21T14:22:18Z

@asenmitrev Yahma with which version of peft?
Because i tired it same problem
I got confused
They should add description in readme or fixed requirements

asenmitrev · 2023-05-21T14:47:00Z

The one quoted in my first answer.

SimpleConjugate · 2023-05-24T19:37:30Z

You should try adding repetition_penalty keyword argument to generation config in the evaluate function. repetition_penalty >1 should do it. I finetuned a model and used repetition_penalty=2 to resolve the problem for myself.

abdoelsayed2016 changed the title ~~after finetue the model repeat the answer and not stop~~ After fine-tuning, the model repeat the answer and not stop May 19, 2023

ZJU-Fangyin mentioned this issue Jun 23, 2023

Hyperparameter zjunlp/Mol-Instructions#1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

After fine-tuning, the model repeat the answer and not stop #467

After fine-tuning, the model repeat the answer and not stop #467

abdoelsayed2016 commented May 19, 2023 •

edited

Loading

aryanq123 commented May 20, 2023 •

edited

Loading

Qubitium commented May 20, 2023

abdoelsayed2016 commented May 20, 2023 •

edited

Loading

abdoelsayed2016 commented May 20, 2023

abdoelsayed2016 commented May 20, 2023

asenmitrev commented May 20, 2023 •

edited

Loading

abdoelsayed2016 commented May 20, 2023 •

edited

Loading

abdoelsayed2016 commented May 21, 2023 •

edited

Loading

asenmitrev commented May 21, 2023

abdoelsayed2016 commented May 21, 2023 •

edited

Loading

asenmitrev commented May 21, 2023

SimpleConjugate commented May 24, 2023 •

edited

Loading

After fine-tuning, the model repeat the answer and not stop #467

After fine-tuning, the model repeat the answer and not stop #467

Comments

abdoelsayed2016 commented May 19, 2023 • edited Loading

aryanq123 commented May 20, 2023 • edited Loading

Qubitium commented May 20, 2023

abdoelsayed2016 commented May 20, 2023 • edited Loading

abdoelsayed2016 commented May 20, 2023

abdoelsayed2016 commented May 20, 2023

asenmitrev commented May 20, 2023 • edited Loading

abdoelsayed2016 commented May 20, 2023 • edited Loading

abdoelsayed2016 commented May 21, 2023 • edited Loading

asenmitrev commented May 21, 2023

abdoelsayed2016 commented May 21, 2023 • edited Loading

asenmitrev commented May 21, 2023

SimpleConjugate commented May 24, 2023 • edited Loading

abdoelsayed2016 commented May 19, 2023 •

edited

Loading

aryanq123 commented May 20, 2023 •

edited

Loading

abdoelsayed2016 commented May 20, 2023 •

edited

Loading

asenmitrev commented May 20, 2023 •

edited

Loading

abdoelsayed2016 commented May 20, 2023 •

edited

Loading

abdoelsayed2016 commented May 21, 2023 •

edited

Loading

abdoelsayed2016 commented May 21, 2023 •

edited

Loading

SimpleConjugate commented May 24, 2023 •

edited

Loading