Non-Cooperative Inverse Reinforcement Learning
–Neural Information Processing Systems
Making decisions in the presence of a strategic opponent requires one to take into account the opponent's ability to actively mask its intended objective. To describe such strategic situations, we introduce the non-cooperative inverse reinforcement learning (N-CIRL) formalism. The N-CIRL formalism consists of two agents with completely misaligned objectives, where only one of the agents knows the true objective function.
Neural Information Processing Systems
Dec-25-2025, 09:53:11 GMT
- Technology: