Theoretical Analysis of KL-regularized RLHF with Multiple Reference Models

Open in new window