Optimizing Pessimism in Dynamic Treatment Regimes: A Bayesian Learning Approach