Incremental Spectral Sparsification for Large-Scale Graph-Based Semi-Supervised Learning

Calandriello, Daniele, Lazaric, Alessandro, Valko, Michal, Koutis, Ioannis

Jan-21-2016–arXiv.org Machine Learning

While the harmonic function solution performs well in many semi-supervised learning (SSL) tasks, it is known to scale poorly with the number of samples. Recent successful and scalable methods, such as the eigenfunction method focus on efficiently approximating the whole spectrum of the graph Laplacian constructed from the data. This is in contrast to various subsampling and quantization methods proposed in the past, which may fail in preserving the graph spectra. However, the impact of the approximation of the spectrum on the final generalization error is either unknown, or requires strong assumptions on the data. In this paper, we introduce Sparse-HFS, an efficient edge-sparsification algorithm for SSL. By constructing an edge-sparse and spectrally similar graph, we are able to leverage the approximation guarantees of spectral sparsification methods to bound the generalization error of Sparse-HFS. As a result, we obtain a theoretically-grounded approximation scheme for graph-based SSL that also empirically matches the performance of known large-scale methods.

artificial intelligence, inductive learning, sparse-hf, (18 more...)

arXiv.org Machine Learning

Jan-21-2016

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Maryland (0.14)
  - Wisconsin (0.14)

Genre:
- Research Report (0.64)

Industry:
- Education (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Inductive Learning (0.62)
  - Statistical Learning (0.68)
  - Unsupervised or Indirectly Supervised Learning (0.71)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found