Exploiting Inferential Structure in Neural Processes
Tailor, Dharmesh, Khan, Mohammad Emtiyaz, Nalisnick, Eric
–arXiv.org Artificial Intelligence
Neural Processes (NPs) are appealing due to their ability to perform fast adaptation based on a context set. This set is encoded by a latent variable, which is often assumed to follow a simple distribution. However, in real-word settings, the context set may be drawn from richer distributions having multiple modes, heavy tails, etc. In this work, we provide a framework that allows NPs' latent variable to be given a rich prior defined by a graphical model. These distributional assumptions directly translate into an appropriate aggregation strategy for the context set. Moreover, we describe a message-passing procedure that still allows for end-to-end optimization with stochastic gradients. We demonstrate the generality of our framework by using mixture and Student-t assumptions that yield improvements in function modelling and test-time robustness.
arXiv.org Artificial Intelligence
Jun-26-2023
- Country:
- Europe > Netherlands
- North Holland > Amsterdam (0.04)
- Asia
- Middle East > Jordan (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- Europe > Netherlands
- Genre:
- Research Report (0.50)
- Technology: