Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learning

Neural Information Processing Systems 

We further extend Kaleidoscope to critic ensembles in the context of actor-critic algorithms, which could help improve value estimations.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found