Supervised Learning with Tensor Networks

Stoudenmire, Edwin, Schwab, David J.

Neural Information Processing Systems 

Tensor networks are approximations of high-order tensors which are efficient to work with and have been very successful for physics and mathematics applications. We demonstrate how algorithms for optimizing tensor networks can be adapted to supervised learning tasks by using matrix product states (tensor trains) to parameterize non-linear kernel learning models. For the MNIST data set we obtain less than 1% test set classification error. We discuss an interpretation of the additional structure imparted by the tensor network to the learned model. Papers published at the Neural Information Processing Systems Conference.