Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning
–Neural Information Processing Systems
Reinforcement Learning from Human Feedback (RLHF) is a powerful paradigm for aligning foundation models to human values and preferences.
Neural Information Processing Systems
Mar-21-2025, 09:08:54 GMT
- Country:
- Europe > Switzerland
- North America
- Canada (0.28)
- United States > Washington
- King County > Seattle (0.14)
- Genre:
- Research Report > Experimental Study (0.93)
- Technology: