Do Transformer World Models Give Better Policy Gradients?