Generalizing the chat_template prompt strategy #1660

fozziethebeat · 2024-05-27T05:48:58Z

Description

The strategy now supports configuring several fields:

The data field holding message arrays
the role and content fields for each message
role mapping from source to target types

additionally this adds a sample llama3-8b instruct template using the chat template

Fixes #1654

Motivation and Context

#1654

How has this been tested?

Tested via

pytest --ignore tests/e2e

Further it was tested by running

python -m axolotl.cli.preprocess examples/llama-3/instruct-lora-8b.yml

And manually inspecting the emitted sample tokens

Screenshots (if appropriate)

Types of changes

Generalizing input data configuration options

Social Handles (Optional)

fozziethebeat

The strategy now supports configuring several fields: * The data field holding message arrays * the role and content fields for each message * role mapping from source to target types additionally this adds a sample llama3-8b instruct template using the chat template

winglian

Very much needed. Thank you again!

hammoudhasan · 2024-05-28T12:54:53Z

examples/llama-3/instruct-lora-8b.yml

+chat_template: llama3
+datasets:
+  - path: fozziethebeat/alpaca_messages_2k_test
+    type: chat_template
+    chat_template: llama3
+    field_messages: messages
+    message_field_role: role
+    message_field_content: content


Do we really need to assign the chat_template twice in dataset and outside? I'm testing this PR. Any difference between the two "chat_template" settings?

I feel that passing type: chat_template and the related field keys is already specifying how to load the data. The value of the chat_template should be the template used for tokenization for training.

we already have some handling when using this with sharegpt and chatml, so I've updated that to handle it automatically for the general case here: #1664

fozziethebeat · 2024-05-28T23:03:35Z

Great! Thanks for merging!

winglian approved these changes May 28, 2024

View reviewed changes

hammoudhasan reviewed May 28, 2024

View reviewed changes

winglian merged commit cc11c6b into axolotl-ai-cloud:main May 28, 2024
7 checks passed

fozziethebeat mentioned this pull request Jun 14, 2024

Add a chat_template strategy for DPO datasets #1708

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generalizing the chat_template prompt strategy #1660

Generalizing the chat_template prompt strategy #1660

fozziethebeat commented May 27, 2024

winglian left a comment

hammoudhasan May 28, 2024 •

edited

Loading

winglian May 28, 2024

fozziethebeat commented May 28, 2024

Generalizing the chat_template prompt strategy #1660

Generalizing the chat_template prompt strategy #1660

Conversation

fozziethebeat commented May 27, 2024

Description

Motivation and Context

How has this been tested?

Screenshots (if appropriate)

Types of changes

Social Handles (Optional)

winglian left a comment

Choose a reason for hiding this comment

hammoudhasan May 28, 2024 • edited Loading

Choose a reason for hiding this comment

winglian May 28, 2024

Choose a reason for hiding this comment

fozziethebeat commented May 28, 2024

hammoudhasan May 28, 2024 •

edited

Loading