DistributedInverseConstrainedReinforcement LearningforMulti-agentSystems
–Neural Information Processing Systems
We formally guarantee that the distributed learners asymptotically achieve consensus which belongs to the set of stationary points of the bi-level optimization problem.
Neural Information Processing Systems
Feb-12-2026, 05:20:06 GMT