Non-Cooperative Inverse Reinforcement Learning
Zhang, Xiangyuan, Zhang, Kaiqing, Miehling, Erik, Basar, Tamer
–Neural Information Processing Systems
Making decisions in the presence of a strategic opponent requires one to take into account the opponent's ability to actively mask its intended objective. To describe such strategic situations, we introduce the non-cooperative inverse reinforcement learning (N-CIRL) formalism. The N-CIRL formalism consists of two agents with completely misaligned objectives, where only one of the agents knows the true objective function. As a result of the one-sided incomplete information, the multi-stage game can be decomposed into a sequence of single- stage games expressed by a recursive formula. Solving this recursive formula yields the value of the N-CIRL game and the more informed player's equilibrium strategy.
Neural Information Processing Systems
Mar-19-2020, 00:30:42 GMT
- Technology: