Goto

Collaborating Authors

 Reinforcement Learning


XDO: ADoubleOracleAlgorithmfor Extensive-FormGames

Neural Information Processing Systems

Policy Space Response Oracles (PSRO) is a reinforcement learning (RL) algorithm for two-player zero-sum games that has been empirically shown to find approximate Nash equilibria in large games.








TowardsTrustworthyAutomaticDiagnosisSystemsby EmulatingDoctors'ReasoningwithDeep ReinforcementLearning

Neural Information Processing Systems

Moreover,doctors explicitly explore severepathologies before potentially ruling them out from the differential, especially in acute care settings. Finally, for doctors to trust a system's recommendations, they need to understand how the gathered evidences led to the predicted diseases.