Goto

Collaborating Authors

 Reinforcement Learning





Time-Constrained Robust MDPs

Neural Information Processing Systems

Traditional robust reinforcement learning often depends on rectangularity assumptions, where adverse probability measures of outcome states are assumed to be independent across different states and actions.


DynamicInverseReinforcementLearningfor CharacterizingAnimalBehavior

Neural Information Processing Systems

While many models have been developed for characterizing behavior in binary decision-making and bandit tasks, comparatively little work has focused onanimal decision-making inmorecomplextasks,suchasnavigationthrough a maze.