Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Increment max penalty + complement task with layout criteria #86

Merged
merged 6 commits into from
Nov 29, 2023

Conversation

p-ferreira
Copy link
Contributor

  • increment max penalty to .75
  • complements summary task creation with layout criteria (bullet points, numbered list and “raw” simple layout)
  • increase penalty landscape for all tasks to sum to .75

Note: The tasks of qg and qa already follow a defined layout, so the expansion was performed on the summarization task

else:
layout_criteria = MatchLayoutCriteria(
layout_type=LayoutMatchTypeEnum.NUMBERED_LIST,
penalty=0.5,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

no text here?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it will get the default text Your response should be ordered in format of numbered list. The bullet point criteria would list like "unordered list", that's why I override it to be "bullet points"

Copy link
Contributor

@steffencruz steffencruz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks fine, but do we need extra description on the numbered list?

else:
layout_criteria = MatchLayoutCriteria(
layout_type=LayoutMatchTypeEnum.NUMBERED_LIST,
penalty=0.5,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
penalty=0.5,
penalty=0.5,
text="Your response should be ordered in format of numbered points.",

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It will get it by default Your response should be ordered in format of numbered list

@steffencruz
Copy link
Contributor

Extra comment: what are the existing layout criteria used by qg and qa?

  • qg is simply exclude any list-like completion?
  • ga is ..?

@p-ferreira
Copy link
Contributor Author

Extra comment: what are the existing layout criteria used by qg and qa?

  • qg is simply exclude any list-like completion?
  • ga is ..?

qg:

  • Question starts with prefix (e.g. "what", "when", "where", etc.)
  • Question ends with "?"
  • random criteria of either char length or word length

qa:

  • answer should not include "?"
  • text should follow "simple layout" (meaning no unordered lists or numbered lists)
  • random word len criteria

@p-ferreira p-ferreira merged commit 01a3660 into staging Nov 29, 2023
4 checks passed
@steffencruz
Copy link
Contributor

Do we specify the number of list items?

@steffencruz
Copy link
Contributor

and why not update the qa criteria to sometimes allow lists? this is actually reasonable behaviour at this point

@p-ferreira
Copy link
Contributor Author

Do we specify the number of list items?

no, it's a binary verification, it was too last minute to bring this sort of complexity to the release imho

@p-ferreira
Copy link
Contributor Author

and why not update the qa criteria to sometimes allow lists? this is actually reasonable behaviour at this point

because most of the miners already return in list layout already since they are leveraging the qg+qa / key takeaways exploits, so forcing it to be simple punishes those type of exploits

@steffencruz
Copy link
Contributor

good stuff. thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants