DERAIL: Diagnostic Environments for Reward And Imitation Learning