Hierarchical Soft Actor-Critic: Adversarial Exploration via Mutual Information Optimization

Open in new window