reinforcement-learning-from-human-feedback

Here are 15 public repositories matching this topic...

Chinmaya-Kausik / RLHF-comparison

Comparing various RLHF methods

reinforcement-learning transformers transformer ppo dpo llm llms rlhf reinforcement-learning-from-human-feedback reinforcement-learning-from-ai-feedback

Updated Sep 23, 2024
Jupyter Notebook

umenzi / diversity-rlhf

Star

Code for Bachelor thesis, The Human Factor: Addressing Diversity in Reinforcement Learning from Human Feedback.

reinforcement-learning tudelft-cse-research-project rlhf reinforcement-learning-from-human-feedback

Updated Aug 17, 2024
Python

ymnseol / weekly-paper-reading-group

Star

Summaries of papers related to the alignment problem in NLP

nlp natural-language-processing rlhf instruction-tuning reinforcement-learning-from-human-feedback

Updated May 29, 2023

SJ9VRF / Reinforcement-Learning-for-Human-Feedback-RLHF-

Star

This repository contains the implementation of a Reinforcement Learning with Human Feedback (RLHF) system using custom datasets. The project utilizes the trlX library for training a preference model that integrates human feedback directly into the optimization of language models.

language-model language-mo llms rlhf reinforcement-learning-from-human-feedback

Updated Aug 17, 2024
Python

Almost-Intelligence / LMRax

Star

LMRax is a framework built on JAX to train transformers language models by reinforcement learning, along with the reward model training.

reinforcement-learning transformer language-model jax reinforcement-learning-from-human-feedback

Updated Mar 3, 2023
Python

liushunyu / Ask-AC

Star

[TSMC] Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework

reinforcement-learning reinforcement-learning-from-human-feedback action-advising

Updated Jun 28, 2024
Python

ymetz / rlhfblender

Star

RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback

react python reinforcement-learning experimentation human-ai-interaction reinforcement-learning-from-human-feedback

Updated Oct 4, 2024
Python

XplainMind / LLMindCraft

Star

Shaping Language Models with Cognitive Insights

docker transformers pretraining deepspeed large-language-models reinforcement-learning-from-human-feedback instruct-tuning

Updated Feb 29, 2024
Python

clam004 / minichatgpt

Star

annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation

nlp reinforcement-learning deep-learning transformers deep-reinforcement-learning pytorch language-model fine-tuning large-language-models reinforcement-learning-from-human-feedback