[R] A Bayesian Perspective on Q-Learning