Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a chat_template prompt strategy for DPO #1725

Merged
merged 6 commits into from
Jul 21, 2024

Commits on Jun 15, 2024

  1. Implementing a basic chat_template strategy for DPO datasets

    This mimics the sft chat_template strategy such that users can:
    * Specify the messages field
    * Specify the per message role and content fields
    * speicfy the chosen and rejected fields
    * Let the tokenizer construct the raw prompt
    * Ensure the chosen and rejected fields don't have any prefix tokens
    fozziethebeat committed Jun 15, 2024
    Configuration menu
    Copy the full SHA
    bed95c9 View commit details
    Browse the repository at this point in the history

Commits on Jul 2, 2024

  1. Configuration menu
    Copy the full SHA
    17f4117 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    61000d5 View commit details
    Browse the repository at this point in the history

Commits on Jul 5, 2024

  1. Rename test class

    fozziethebeat committed Jul 5, 2024
    Configuration menu
    Copy the full SHA
    6654826 View commit details
    Browse the repository at this point in the history

Commits on Jul 15, 2024

  1. Configuration menu
    Copy the full SHA
    73a66cc View commit details
    Browse the repository at this point in the history

Commits on Jul 19, 2024

  1. Configuration menu
    Copy the full SHA
    8436e80 View commit details
    Browse the repository at this point in the history