Policy Gradient for LQR with Domain Randomization