Goto

Collaborating Authors

 Reinforcement Learning



Distributional Policy Evaluation: a Maximum Entropy approach to Representation Learning

Neural Information Processing Systems

In Distributional Reinforcement Learning (D-RL) [Bellemare et al., 2023], an agent aims to estimate Sutton and Barto, 2018], where the objective is to predict the expected return only. In Section 3, we answer this methodological question, showing that it is possible to reformulate Policy Evaluation in a distributional setting so that its performance index is explicitly intertwined with the representation of the (state or action) spaces.









A Partially Supervised Reinforcement Learning Framework for Visual Active Search

Neural Information Processing Systems

Moreover, query results (e.g., detected search and rescue activity in a particular region) are highly informative about the locations of target objects in other regions, for example, due to spatial