On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces