f50a6c02a3fc5a3a5d4d9391f05f3efc-Paper.pdf

Neural Information Processing Systems 

Intoyenvironments, Attainable Utility Preservation (AUP)avoided side effects by penalizing shifts in the ability to achieve randomly generated goals [22]. We scale this approach to large, randomly generated environments based onConway'sGame ofLife.