Reinforcement Learning from Human Feedback: A Statistical Perspective