Semi-Parametric Inducing Point Networks and Neural Processes

Rastogi, Richa, Schiff, Yair, Hacohen, Alon, Li, Zhaozhi, Lee, Ian, Deng, Yuntian, Sabuncu, Mert R., Kuleshov, Volodymyr

Mar-30-2023–arXiv.org Artificial Intelligence

We introduce semi-parametric inducing point networks (SPIN), a general-purpose architecture that can query the training set at inference time in a compute-efficient manner. Semi-parametric architectures are typically more compact than parametric models, but their computational complexity is often quadratic. In contrast, SPIN attains linear complexity via a cross-attention mechanism between datapoints inspired by inducing point methods. Querying large training sets can be particularly useful in meta-learning, as it unlocks additional training signal, but often exceeds the scaling limits of existing models. We use SPIN as the basis of the Inducing Point Neural Process, a probabilistic model which supports large contexts in meta-learning and achieves high accuracy where existing models fail. In our experiments, SPIN reduces memory requirements, improves accuracy across a range of meta-learning tasks, and improves state-of-the-art performance on an important practical problem, genotype imputation.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Mar-30-2023

arXiv.org PDF

Add feedback

Country:
- Europe (0.67)
- North America > United States
  - Minnesota (0.28)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks > Deep Learning (0.67)
  - Performance Analysis > Accuracy (0.46)
  - Statistical Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found