Goto

Collaborating Authors

 Reinforcement Learning




Off-Policy Selection for Initiating Human-Centric Experimental Design Ge Gao Xi Y ang

Neural Information Processing Systems

Human-centric systems (HCSs), e.g. , used in healthcare facilities [ Given the long testing horizon ( e.g. , several years, or semesters, in healthcare, and IE, respectively) and the high cost of recruiting participants, online testing is considered exceedingly The work was done at North Carolina State University. In this section, we introduce the FPS method, which determines the policy to be deployed to new participants that join an existing cohort, conditioned only on their initial states.


Scalable Constrained Policy Optimization for Safe Multi-agent Reinforcement Learning

Neural Information Processing Systems

A challenging problem in seeking to bring multi-agent reinforcement learning (MARL) techniques into real-world applications, such as autonomous driving and drone swarms, is how to control multiple agents safely and cooperatively to accomplish tasks.