Inverse Q-Learning Done Right: Offline Imitation Learning in Q \pi -Realizable MDPs

Open in new window