Reinforcement Learning
Off-Policy Selection for Initiating Human-Centric Experimental Design Ge Gao Xi Y ang
Human-centric systems (HCSs), e.g. , used in healthcare facilities [ Given the long testing horizon ( e.g. , several years, or semesters, in healthcare, and IE, respectively) and the high cost of recruiting participants, online testing is considered exceedingly The work was done at North Carolina State University. In this section, we introduce the FPS method, which determines the policy to be deployed to new participants that join an existing cohort, conditioned only on their initial states.