Goto

Collaborating Authors

 Banff






58238e9ae2dd305d79c2ebc8c1883422-Paper.pdf

Neural Information Processing Systems

Additionally,bGPFA uses automatic relevance determination to infer the dimensionality of neural activity directly from the training data during optimization.




MinglingForesightwithImagination: Model-Based CooperativeMulti-AgentReinforcementLearning

Neural Information Processing Systems

Thispaperproposes animplicit model-based multi-agent reinforcement learning method based onvalue decomposition methods. Under this method, agents can interact with thelearned virtual environment and evaluate thecurrent state value according to imagined future states in the latent space, making agents have the foresight. Our approach can be applied toanymulti-agent value decomposition method.