Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL