Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents

Open in new window