Transformer-based WorkingMemoryforMultiagent ReinforcementLearningwithActionParsing
–Neural Information Processing Systems
Learning in real-world multiagent tasks is challenging due to the usual partial observability ofeach agent. Previous efforts alleviate thepartial observability by historical hidden states with Recurrent Neural Networks, however, they do not consider themultiagent characters thateither themultiagent observationconsists ofanumber ofobject entities orthe action space shows clear entity interactions.
Neural Information Processing Systems
Feb-12-2026, 10:36:30 GMT
- Country:
- Asia > China (0.04)
- North America > United States
- Massachusetts > Middlesex County > Cambridge (0.04)
- Technology: