Reviews: Causal Confusion in Imitation Learning

Jan-25-2025, 19:08:28 GMT–Neural Information Processing Systems

Summary: This paper has a very interesting claim: distributional shift in imitation learning settings is primarily caused by causal misidentification of the features by the learning algorithm. An interesting example is that of a self-driving car policy trained on a dataset of paired image-control datapoints collected by an expert human driving the car. If the images contain the turn signal on the dashboard then the supervised learner learns to have very good predictive power by indexing on that feature in the image. Clearly that does not generalize during test time. While this is a pathological example, such behavior is present in most settings where usually the state is blown-up by appending past states and actions.

causal confusion, imitation learning, textrm, (12 more...)

Neural Information Processing Systems

Jan-25-2025, 19:08:28 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Robots > Autonomous Vehicles (0.56)