Reinforcement Learning from Human Feedback: A Statistical Perspective

Open in new window