Appendices Contents Appendices 18

Neural Information Processing Systems 

To investigate further, we ran several instances of FP and SFP from random starting points (i.e. initial policy generated by normalizing uniformly drawn random numbers); results are