RLHF

RLHF

/ˌɑːr el eɪtʃ ˈef/

Generative AI

Reinforcement Learning from Human Feedback for training AI

RLHF in a sentence

RLHF helped the model produce more helpful and harmless responses.

Origin of RLHF

Acronym combining reinforcement (Latin re- + fortis) + learning + human (Latin humanus) + feedback