Goto

Collaborating Authors

 Planning & Scheduling







CollapsingBanditsandTheirApplicationtoPublic HealthInterventions

Neural Information Processing Systems

Neither (i) nor (ii) are known for general RMABs. Therefore, to capture the scheduling problems addressed inthiswork,weintroduce anewsubclass ofRMABs,Collapsing Bandits, distinguished by the following feature: when an arm is played, the agent fully observes its state, "collapsing" any uncertainty, but when an arm is passive, no observation is made and uncertainty evolves.




6af779991368999ab3da0d366c208fba-Paper-Conference.pdf

Neural Information Processing Systems

Planning enables autonomous agents to solve complex decision-making problems by evaluating predictions of the future. However, classical planning algorithms often become infeasible in real-world settings where state spaces are high-dimensional andtransitiondynamicsunknown.