Goto

Collaborating Authors

 partial identifiability


On the Partial Identifiability in Reward Learning: Choosing the Best Reward

arXiv.org Machine Learning

When the feedback is not informative enough, the target However, in practice, ReL has been successfully applied reward is only partially identifiable, i.e., there only to IL (Ho & Ermon, 2016) and reward design (Christiano exists a set of rewards (the feasible set) that are et al., 2017). The most significant issue that prevents equally-compatible with the feedback. In this paper, the use of ReL algorithms to other applications is partial we show that there exists a choice of reward, identifiability (Cao et al., 2021; Kim et al., 2021; Skalse non-necessarily contained in the feasible set that, et al., 2023b). Indeed, the target reward may not be uniquely depending on the ReL application, improves the determined from the given feedback, but there is a set of reward performance w.r.t.


Partial Identifiability in Inverse Reinforcement Learning For Agents With Non-Exponential Discounting

arXiv.org Artificial Intelligence

The aim of inverse reinforcement learning (IRL) is to infer an agent's preferences from observing their behaviour. Usually, preferences are modelled as a reward function, $R$, and behaviour is modelled as a policy, $\pi$. One of the central difficulties in IRL is that multiple preferences may lead to the same observed behaviour. That is, $R$ is typically underdetermined by $\pi$, which means that $R$ is only partially identifiable. Recent work has characterised the extent of this partial identifiability for different types of agents, including optimal and Boltzmann-rational agents. However, work so far has only considered agents that discount future reward exponentially: this is a serious limitation, especially given that extensive work in the behavioural sciences suggests that humans are better modelled as discounting hyperbolically. In this work, we newly characterise partial identifiability in IRL for agents with non-exponential discounting: our results are in particular relevant for hyperbolical discounting, but they also more generally apply to agents that use other types of (non-exponential) discounting. We significantly show that generally IRL is unable to infer enough information about $R$ to identify the correct optimal policy, which entails that IRL alone can be insufficient to adequately characterise the preferences of such agents.


Partial Identifiability for Domain Adaptation

arXiv.org Artificial Intelligence

Unsupervised domain adaptation is critical to many real-world applications where label information is unavailable in the target domain. In general, without further assumptions, the joint distribution of the features and the label is not identifiable in the target domain. To address this issue, we rely on the property of minimal changes of causal mechanisms across domains to minimize unnecessary influences of distribution shifts. To encode this property, we first formulate the data-generating process using a latent variable model with two partitioned latent subspaces: invariant components whose distributions stay the same across domains and sparse changing components that vary across domains. We further constrain the domain shift to have a restrictive influence on the changing components. Under mild conditions, we show that the latent variables are partially identifiable, from which it follows that the joint distribution of data and labels in the target domain is also identifiable. Given the theoretical insights, we propose a practical domain adaptation framework called iMSDA. Extensive experimental results reveal that iMSDA outperforms state-of-the-art domain adaptation algorithms on benchmark datasets, demonstrating the effectiveness of our framework.