Update rlhf.md (#1178) [skip ci]

axolotl-ai-cloud · Jan 23, 2024 · dc051b8 · dc051b8
1 parent 59a31fe
commit dc051b8
Showing 1 changed file with 3 additions and 3 deletions.
diff --git a/docs/rlhf.md b/docs/rlhf.md
@@ -19,14 +19,14 @@ The various RL training methods are implemented in trl and wrapped via axolotl.
 
 #### DPO
 ```yaml
-rl: true
+rl: dpo
 datasets:
   - path: Intel/orca_dpo_pairs
     split: train
-    type: intel_apply_chatml
+    type: chatml.intel
   - path: argilla/ultrafeedback-binarized-preferences
     split: train
-    type: argilla_apply_chatml
+    type: chatml.argilla
 ```
 
 #### IPO