Rethinking Inverse Reinforcement Learning: from Data Alignment to Task Alignment

Open in new window