Fix prompt assembly for llama #952

hamelsmu · 2023-12-14T01:18:19Z

UPDATE: I found many issues with the llama templating after fixing the omission of the human message. I found the following issues:

EOS/BOS being applied incorrectly in between turns
System Message was incorrectly formatted with a [INST] where it wasn't supposed to be
Proper handling of multi-turn
etc.

I added lots of tests for the following cases:

Multi turn
Single turn
Multi turn without system message
Single turn without system message

For the tests, I put a redundant decoded version of the string so that you can see the prompt which will help with code review and understanding

hamelsmu · 2023-12-14T02:58:16Z

@winglian ok this is ready for your review

src/axolotl/monkeypatch/fastchat_conversation_turns.py

Co-authored-by: Motoki Wu <tokestermw@gmail.com>

hamelsmu · 2023-12-14T09:45:01Z

@winglian I fixed quite a few issues that I found. I added a whole bunch of tests.

hamelsmu · 2023-12-14T10:05:47Z

src/axolotl/monkeypatch/fastchat_conversation_turns.py

+                if (i % 2 == 0 and not self.system_message) or (
+                    i % 2 != 0 and self.system_message
+                ):
+                    role = "<s> " + role


The only thing I don't love is hardcoding the BOS token here but I couldn't see a better way to fix this ATM

jph00 · 2023-12-14T10:38:51Z

@hamelsmu you're a legend and a hero for doing this

tokestermw · 2023-12-14T18:14:57Z

tests/test_prompt_tokenizers.py

+        }
+        # fmt: off
+        mt_ids = tokenize(multi_turn_conv)
+        assert decode(mt_ids) == '<s> [INST] <<SYS>>\nlorem\n<</SYS>>\n\nabc [/INST] ipsum</s><s> [INST] 123 [/INST] sit</s>'


at least for mistral instruct, it doesn't seem to add a BOS_ID for every turn

[BOS_ID] + tokenize("[INST]") + tokenize(USER_MESSAGE_1) + tokenize("[/INST]") + tokenize(BOT_MESSAGE_1) + [EOS_ID] + … tokenize("[INST]") + tokenize(USER_MESSAGE_N) + tokenize("[/INST]") + tokenize(BOT_MESSAGE_N) + [EOS_ID]

https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1
and
https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2

i can try adding some tests separately

Mistral instruct also doesn't have BOS tokens in between turns. It's probably best to separate llama and mistral prompt style?

@tokestermw sorry what I mean to say is mistral instruct official template actually does NOT have BOS in between turns! If you read the link you sent carefully (I missed this too at first)

does NOT have BOS in between turns

yes, that's what i meant as well :) agree on separating llama and mistral, but maybe can do on fastchat repo (or both)

casper-hansen · 2023-12-14T19:07:16Z

Oh my, this is great @hamelsmu. We all mostly use ChatML for conversation by now (openorca, openhermes, dolphin), so it would be amazing to have more testing for other formats as well. The prompt formatting is something that I have seen issues with before, so it's great to see that it's being handled now!

start at index 0

ebdb7f7

hamelsmu added the bug Something isn't working label Dec 14, 2023

hamelsmu requested a review from winglian December 14, 2023 01:30

hamelsmu added 3 commits December 14, 2023 02:21

add test to check for missing turns

95de764

apply black

1394cf1

Update test_prompt_tokenizers.py

02afbbf

tokestermw reviewed Dec 14, 2023

View reviewed changes

src/axolotl/monkeypatch/fastchat_conversation_turns.py Outdated Show resolved Hide resolved

hamelsmu and others added 4 commits December 13, 2023 22:32

Update src/axolotl/monkeypatch/fastchat_conversation_turns.py

d835a18

Co-authored-by: Motoki Wu <tokestermw@gmail.com>

fix linting

6d8d3b7

apply black

e44a852

add more tests for llama/sharegpt

81b7c5b

hamelsmu changed the title ~~start at index 0 for fastchat prompt assembly for llama~~ Fix fastchat prompt assembly for llama Dec 14, 2023

hamelsmu changed the title ~~Fix fastchat prompt assembly for llama~~ Fix prompt assembly for llama Dec 14, 2023

make logic clearer

87feb4c

hamelsmu commented Dec 14, 2023

View reviewed changes

winglian approved these changes Dec 14, 2023

View reviewed changes

hamelsmu merged commit 5ada140 into main Dec 14, 2023
4 checks passed

tokestermw reviewed Dec 14, 2023

View reviewed changes

hamelsmu mentioned this pull request Dec 20, 2023

fix mistral prompt assembly #982

Merged

winglian deleted the fix-human branch January 23, 2024 12:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix prompt assembly for llama #952

Fix prompt assembly for llama #952

hamelsmu commented Dec 14, 2023 •

edited

Loading

hamelsmu commented Dec 14, 2023

hamelsmu commented Dec 14, 2023

hamelsmu Dec 14, 2023

jph00 commented Dec 14, 2023

tokestermw Dec 14, 2023 •

edited

Loading

hamelsmu Dec 14, 2023 •

edited

Loading

hamelsmu Dec 14, 2023

tokestermw Dec 14, 2023

hamelsmu Dec 14, 2023

casper-hansen commented Dec 14, 2023

Fix prompt assembly for llama #952

Fix prompt assembly for llama #952

Conversation

hamelsmu commented Dec 14, 2023 • edited Loading

hamelsmu commented Dec 14, 2023

hamelsmu commented Dec 14, 2023

hamelsmu Dec 14, 2023

Choose a reason for hiding this comment

jph00 commented Dec 14, 2023

tokestermw Dec 14, 2023 • edited Loading

Choose a reason for hiding this comment

hamelsmu Dec 14, 2023 • edited Loading

Choose a reason for hiding this comment

hamelsmu Dec 14, 2023

Choose a reason for hiding this comment

tokestermw Dec 14, 2023

Choose a reason for hiding this comment

hamelsmu Dec 14, 2023

Choose a reason for hiding this comment

casper-hansen commented Dec 14, 2023

hamelsmu commented Dec 14, 2023 •

edited

Loading

tokestermw Dec 14, 2023 •

edited

Loading

hamelsmu Dec 14, 2023 •

edited

Loading