PALMER: Perception-ActionLoopwithMemory forLong-HorizonPlanning

Neural Information Processing Systems 

This creates a tight feedback loop between representation learning,memory,reinforcementlearning,andsampling-basedplanning.