525b8410cc8612283c9ecaf9a319f8ed-Supplemental.pdf

Neural Information Processing Systems 

The gray agent consistently chooses the cyan object over the yellow object (a) . The same gray agent moves to the preferred cyan object (b).