PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning