You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
PROBLEM
For certain queries, the model is unable to respond to a valid request due to 'RLHF-overfitting'.
SOLUTION
Implement a layer that converts valid queries that might get rejected to a form that is more likely to be accepted by the LLM. The layer could also utilize RAG etc.
This discussion was converted from issue #184 on April 04, 2024 19:59.
Heading
Bold
Italic
Quote
Code
Link
Numbered list
Unordered list
Task list
Attach files
Mention
Reference
Menu
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
PROBLEM
For certain queries, the model is unable to respond to a valid request due to 'RLHF-overfitting'.
SOLUTION
Implement a layer that converts valid queries that might get rejected to a form that is more likely to be accepted by the LLM. The layer could also utilize RAG etc.
ALTERNATIVES
OTHER INFO
Beta Was this translation helpful? Give feedback.
All reactions