Popular repositories Loading
-
supervised-llm-uncertainty-estimation
supervised-llm-uncertainty-estimation PublicThis repo contains code for paper: "Uncertainty Estimation and Quantification for LLMs: A Simple Supervised Approach".
Jupyter Notebook 13
-
OrdinalRewardModeling
OrdinalRewardModeling PublicOfficial codebase for paper "Reward Modeling with Ordinal Feedback: Wisdom of the Crowd".
-
-
awesome-rlhf
awesome-rlhf PublicForked from louieworth/awesome-rlhf
An index of algorithms for reinforcement learning from human feedback (rlhf))
-
awesome-RLHF-opendilab
awesome-RLHF-opendilab PublicForked from opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.