Constrained Policy Optimization via Bayesian World Models