Faster Kernel Interpolation for Gaussian Processes

Yadav, Mohit, Sheldon, Daniel, Musco, Cameron

Jan-27-2021–arXiv.org Artificial Intelligence

A key challenge in scaling Gaussian Process (GP) regression to massive datasets is that exact inference requires computation with a dense n x n kernel matrix, where n is the number of data points. Significant work focuses on approximating the kernel matrix via interpolation using a smaller set of m inducing points. Structured kernel interpolation (SKI) is among the most scalable methods: by placing inducing points on a dense grid and using structured matrix algebra, SKI achieves per-iteration time of O(n + m log m) for approximate inference. This linear scaling in n enables inference for very large data sets; however the cost is per-iteration, which remains a limitation for extremely large n. We show that the SKI per-iteration time can be reduced to O(m log m) after a single O(n) time precomputation step by reframing SKI as solving a natural Bayesian linear regression problem with a fixed set of m compact basis functions. With per-iteration complexity independent of the dataset size n for a fixed grid, our method scales to truly massive data sets. We demonstrate speedups in practice for a wide range of m and n and apply the method to GP inference on a three-dimensional weather radar dataset with over 100 million points.

gaussian process, inference, multiplication, (17 more...)

arXiv.org Artificial Intelligence

Jan-27-2021

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > Massachusetts
    - Hampshire County > Amherst (0.04)
  - Canada > Ontario
    - Toronto (0.14)

Genre:
- Research Report (0.64)

Technology:
- Information Technology
  - Data Science (0.93)
  - Artificial Intelligence
    - Representation & Reasoning > Uncertainty (0.88)
    - Machine Learning > Statistical Learning
      - Regression (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found