A theoretical case-study of Scalable Oversight in Hierarchical Reinforcement Learning

Open in new window