Reviews: Negotiable Reinforcement Learning for Pareto Optimal Sequential Decision-Making

Oct-8-2024, 21:06:08 GMT–Neural Information Processing Systems

Summary: This paper reasons about a Pareto optimal social choice function in which the principles seek to agree on how to agree to use a system that acts in a sequential decision-making problem in which the principles may not share the same prior beliefs. Results suggest that to obtain such a function, the mechanism must over time make choices that favor the principle who has beliefs that appear to be more correct. Quality: The work appears to be correct as far as I have been able to discern. However, I do not like the idea of not having the proof of the main theorem (Theorem 4) in the main paper, even if for the sake of brevity. My opinion is that If the theorem is that important, its proof should be next to it.

negotiable reinforcement learning, pareto optimal sequential decision-making, sequential decision problem, (7 more...)

Neural Information Processing Systems

Oct-8-2024, 21:06:08 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Reinforcement Learning (1.00)