PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning

Open in new window