LoveCatc

LoveCatc LoveCatc

Achievements

supervised-llm-uncertainty-estimation supervised-llm-uncertainty-estimation Public

This repo contains code for paper: "Uncertainty Estimation and Quantification for LLMs: A Simple Supervised Approach".

Jupyter Notebook 13
OrdinalRewardModeling OrdinalRewardModeling Public

Official codebase for paper "Reward Modeling with Ordinal Feedback: Wisdom of the Crowd".

Python 2 1
pyfriso pyfriso Public

Forked from synodriver/pyfriso

python binding for froso

Cython
awesome-rlhf awesome-rlhf Public

Forked from louieworth/awesome-rlhf

An index of algorithms for reinforcement learning from human feedback (rlhf))
awesome-RLHF-opendilab awesome-RLHF-opendilab Public

Forked from opendilab/awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)