Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies