Goto

Collaborating Authors

 Reinforcement Learning



TheSensoryNeuronasaTransformer: Permutation-InvariantNeuralNetworksfor ReinforcementLearning

Neural Information Processing Systems

In complex systems, we often observe complex global behavior emerge from a collection of agents interacting with each other in their environment, with each individual agent acting only on locally available information, without knowing thefullpicture.




e9bcd1b063077573285ae1a41025f5dc-Paper.pdf

Neural Information Processing Systems

P2SROisabletoparallelize PSROwith convergence guarantees bymaintaining ahierarchical pipeline ofreinforcement learning workers, each training against the policies generated by lower levels in the hierarchy.