SIDE: I Infer the State I Want to Learn
Xu, Zhiwei, Bai, Yunpeng, Li, Dapeng, Zhang, Bin, Fan, Guoliang
–arXiv.org Artificial Intelligence
On the As one of the solutions to the Dec-POMDP problem, the value other hand, in order to extract helpful information from the state of decomposition method has achieved good results recently. However, the complex environment, some work[12, 19] promotes the neural most value decomposition methods require the global state network to learn useful state information by adding auxiliary tasks during training, but this is not feasible in some scenarios where mainly to predict the state of the next moment. Intuitively, the the global state cannot be obtained. Therefore, we propose a novel problem with these studies is in that they cannot be implemented value decomposition framework, named State Inference for value for tasks that cannot obtain real state information. DEcomposition (SIDE), which eliminates the need to know the true As a notorious problem in MAS, Dec-POMDP[25] describes some state by simultaneously seeking solutions to the two problems of collaboration problems.
arXiv.org Artificial Intelligence
May-13-2021
- Country:
- Europe (0.69)
- North America
- Genre:
- Research Report (0.82)
- Industry:
- Leisure & Entertainment > Games > Computer Games (0.47)
- Technology: