Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets

Jan-15-2025, 11:41:57 GMT–Neural Information Processing Systems

We study a Markov matching market involving a planner and a set of strategic agents on the two sides of the market.At each step, the agents are presented with a dynamical context, where the contexts determine the utilities. The planner controls the transition of the contexts to maximize the cumulative social welfare, while the agents aim to find a myopic stable matching at each step. The proposed algorithm addresses the coupled challenges of sequential exploration, matching stability, and function approximation. We prove that the algorithm achieves sublinear regret.

agent, markov matching market, reinforcement learning, (1 more...)

Neural Information Processing Systems

Jan-15-2025, 11:41:57 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)