An Information-Theoretic Optimality Principle for Deep Reinforcement Learning

Open in new window