Bad-Policy Density: A Measure of Reinforcement Learning Hardness