Aligning Language Models with Human Preferences via a Bayesian Approach Jiashuo W ANG 1, Haozhao W ANG