Stochastic Variational Deep Kernel Learning

Wilson, Andrew G., Hu, Zhiting, Salakhutdinov, Russ R., Xing, Eric P.

Feb-14-2020, 11:56:40 GMT–Neural Information Processing Systems

Deep kernel learning combines the non-parametric flexibility of kernel methods with the inductive biases of deep learning architectures. We propose a novel deep kernel learning model and stochastic variational inference procedure which generalizes deep kernel learning approaches to enable classification, multi-task learning, additive covariance structures, and stochastic gradient training. Specifically, we apply additive base kernels to subsets of output features from deep neural architectures, and jointly learn the parameters of the base kernels and deep network through a Gaussian process marginal likelihood objective. Within this framework, we derive an efficient form of stochastic variational inference which leverages local kernel interpolation, inducing points, and structure exploiting algebra. We show improved performance over stand alone deep networks, SVMs, and state of the art scalable Gaussian processes on several classification benchmarks, including an airline delay dataset containing 6 million training points, CIFAR, and ImageNet.

deep network, gaussian process, stochastic variational deep kernel learning, (2 more...)

Neural Information Processing Systems

Feb-14-2020, 11:56:40 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Systems & Languages > Problem-Independent Architectures (0.66)
  - Machine Learning
    - Statistical Learning (0.66)
    - Neural Networks > Deep Learning (0.30)