Scaling Gaussian Process Regression with Derivatives

David Eriksson, Kun Dong, Eric Lee, David Bindel, Andrew G. Wilson

Neural Information Processing Systems 

Computing the model fit term, as well as the predictive moments of the GP, requires solving linear systems with the kernel matrix, while the complexity term, or Occam'sfactor[18],isthelogdeterminant ofthekernelmatrix.