Confident Natural Policy Gradient for Local Planning in q