Penalty rewards #52

p-ferreira · 2023-10-27T13:06:59Z

This PR seeks to propose a penalty mechanism with:

- Penalty rewards: A new family of function models that will act as penalty functions accordingly to the established definition of the function. This PR also introduces a task-criteria schema that enable criteria assignment to a given task through prompt definition and code validation.

Tasks done so far:

add generic task and criteria schema that can be easily expanded
add SummaryTask, QuestionGenerationTask and QuestionAnswerTask
adds MatchLengthCriteria with support to characters, words, sentences and paragraphs
adapt current workflow to include task validation
adapt current workflow to not concatenate previous answer and questions
add penalty functions
incorporate legacy task validator reward model functionality into new KeywordMatch penalty function
add SentenceMatch penalty function

TODO (Once current design is approved):

reorganize and properly document the code generated
add unit tests for implemented functionality

Wandb run with penalty functions implemented:

…ewards

prompting/validators/reward/keyword_penalty.py

prompting/validators/tasks.py

Unkownman086 · 2023-10-31T12:05:26Z

With the new prompting style you receive much worse ratings from the RLHF model.
I have tried with same context and i am able to measure from average GPT-3.5, vicuna response much worse rating on RLHF model, when normalized ≈ 30% worse.

I whould think the RLHF model doesn't do well, when the question is at the start of the prompt

neurons/validators/validator.py

prompting/validators/tasks.py

prompting/validators/reward/keyword_penalty.py

prompting/validators/penalty/sentence_match.py

prompting/validators/penalty/penalty.py

prompting/validators/forward.py

…sor/text-prompting into features/penalty_rewards

prompting/validators/criteria.py

steffencruz

Please make penalty_scale_factor in prompting/validators/criteria.py a gaussian as i showed in the plot.

prompting/validators/criteria.py

p-ferreira and others added 9 commits October 24, 2023 18:32

update validator version

d09ecbc

adds initial scratch for task and criteria work

b3c508c

Merge branch 'staging' into features/penalty_rewards

47b5da7

Merge remote-tracking branch 'origin/staging' into features/penalty_r…

fbdf4cf

…ewards

integrates task flow with forward step

b957ad0

adds to new fields to wandb event

fa24bff

split criteria and tasks in different files

49a21be

drop task doc class (to do it later)

8160e43

adds keyword penalty model

b3d5ee6

steffencruz reviewed Oct 27, 2023

View reviewed changes

prompting/validators/reward/keyword_penalty.py Outdated Show resolved Hide resolved

steffencruz reviewed Oct 27, 2023

View reviewed changes

prompting/validators/tasks.py Show resolved Hide resolved

steffencruz reviewed Oct 27, 2023

View reviewed changes

prompting/validators/tasks.py Show resolved Hide resolved

steffencruz reviewed Oct 27, 2023

View reviewed changes

prompting/validators/tasks.py Outdated Show resolved Hide resolved

p-ferreira added 6 commits October 27, 2023 21:27

refactor including team considerations

edb801b

update penalty calculations

f3d15c6

adjusts wandb event

6c428ca

runs black on latest changes

7980025

removes redundant key from event log

e531c87

adds license texts on new files

4da916c

p-ferreira requested review from steffencruz, ifrit98, isabella618033 and Eugene-hu October 30, 2023 17:35

p-ferreira added 2 commits October 31, 2023 18:37

adjust sentence match penaly model

7aad7e0

change round of conversations to 3

886e50b

p-ferreira changed the title ~~Penalty rewards [WIP]~~ Penalty rewards Nov 1, 2023

p-ferreira marked this pull request as ready for review November 1, 2023 18:34

steffencruz reviewed Nov 1, 2023

View reviewed changes

adds text separation patterns to keyword match

e624b36

upgrades sentence match

9563614

steffencruz reviewed Nov 1, 2023

View reviewed changes

prompting/validators/forward.py Outdated Show resolved Hide resolved

p-ferreira and others added 18 commits November 1, 2023 20:09

change penalty implementation to use nclip

69f9937

updates criteria regex

e95ff01

removes redundant keyword penalty fn

5653030

updates sentence match to content match

f54f7f6

sample just one criteria

7124353

update penalty scale factor calculation

4eb34ab

change reward evaluation to use task base text

d29ca2b

updates question generation criteria

8df514e

run black in repo

b9682d1

update match len criteria with count sentences fn

2a70d8d

adds pm2 config file to gitignore

437b2a3

update git ignore with pm2 config file

a932e9b

Merge branch 'staging' into features/penalty_rewards

3595ea1

format criteria file with black

85bf11f

Merge branch 'features/penalty_rewards' of https://github.com/openten…

971d20d

…sor/text-prompting into features/penalty_rewards

Merge branch 'staging' into features/penalty_rewards

c39cdab

updating validators version to 2.1.0

58917a7

Merge branch 'features/penalty_rewards' of https://github.com/openten…

84525c1

…sor/text-prompting into features/penalty_rewards

steffencruz approved these changes Nov 1, 2023

View reviewed changes

steffencruz reviewed Nov 1, 2023

View reviewed changes

prompting/validators/criteria.py Outdated Show resolved Hide resolved

steffencruz suggested changes Nov 1, 2023

View reviewed changes

prompting/validators/criteria.py Outdated Show resolved Hide resolved

p-ferreira added 3 commits November 1, 2023 22:10

update penalty scale factor

82691ec

updates _count_sentences regex pattern

d4e95a3

fix pattern definition on criteria

aea679a

steffencruz approved these changes Nov 2, 2023

View reviewed changes

p-ferreira merged commit 3f21ca0 into staging Nov 2, 2023
4 checks passed

p-ferreira mentioned this pull request Nov 2, 2023

2.1.0 Release #59

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Penalty rewards #52

Penalty rewards #52

p-ferreira commented Oct 27, 2023 •

edited

Loading

Unkownman086 commented Oct 31, 2023

steffencruz left a comment

Penalty rewards #52

Penalty rewards #52

Conversation

p-ferreira commented Oct 27, 2023 • edited Loading

Unkownman086 commented Oct 31, 2023

steffencruz left a comment

Choose a reason for hiding this comment

p-ferreira commented Oct 27, 2023 •

edited

Loading