Provable Representation Learning for Imitation Learning via Bi-level Optimization