Non-Cooperative Inverse Reinforcement Learning

Zhang, Xiangyuan, Zhang, Kaiqing, Miehling, Erik, Basar, Tamer

Mar-19-2020, 00:30:42 GMT–Neural Information Processing Systems

Making decisions in the presence of a strategic opponent requires one to take into account the opponent's ability to actively mask its intended objective. To describe such strategic situations, we introduce the non-cooperative inverse reinforcement learning (N-CIRL) formalism. The N-CIRL formalism consists of two agents with completely misaligned objectives, where only one of the agents knows the true objective function. As a result of the one-sided incomplete information, the multi-stage game can be decomposed into a sequence of single- stage games expressed by a recursive formula. Solving this recursive formula yields the value of the N-CIRL game and the more informed player's equilibrium strategy.

formalism, non-cooperative inverse reinforcement learning, recursive formula, (6 more...)

Neural Information Processing Systems

Mar-19-2020, 00:30:42 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)