Rate-Optimal Policy Optimization for Linear Markov Decision Processes