Name	Name	Last commit message	Last commit date
parent directory ..
Screenshots	Screenshots
input	input
notebooks	notebooks
output	output
ReadMe.md	ReadMe.md

Name

Last commit message

Last commit date

RLHF - Reinforcement Learning using Human Feedback

This directory contains notebooks that can be used to train a reward model, and then fine-tune the LLM using Reinforcement Learning. For a detailed overview of Reward Modeling and RLHF, refer to:

RLHF Reward Model Training
Measuring AI Alignment using Human Feedback for RAG Architecture

Architecture

Notebooks

rewardModelTraining.ipynb : This notebook takes in user preference data as input and trains a model of choice to output scaler reward.

RLHFImplementation.ipynb : This notebook takes a SFT LLM, a reward model and data as input. It then finetunes the LLM using PPO.

References

https://github.com/lvwerra/trl
https://argilla.io/blog/argilla-for-llms/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

7. RLHF Model

7. RLHF Model

ReadMe.md

RLHF - Reinforcement Learning using Human Feedback

Architecture

Notebooks

References

Files

7. RLHF Model

Directory actions

More options

Directory actions

More options

Latest commit

History

7. RLHF Model

Folders and files

parent directory

ReadMe.md

RLHF - Reinforcement Learning using Human Feedback

Architecture

Notebooks

References