ThereIsNoTurningBack: ASelf-SupervisedApproachfor Reversibility-AwareReinforcementLearning