Learning Manifolds with K-Means and K-Flats
Canas, Guillermo D., Poggio, Tomaso, Rosasco, Lorenzo
Our study is broadly motivated by questions in high-dimensional learning. As is well known, learning in high dimensions is feasible only if the data distribution satisfies suitable prior assumptions. One such assumption is that the data distribution lies on, or is close to, a low-dimensional set embedded in a high dimensional space, for instance a low dimensional manifold. This latter assumption has proved to be useful in practice, as well as amenable to theoretical analysis, and it has led to a significant amount of recent work. Starting from [29, 40, 7], this set of ideas, broadly referred to as manifold learning, has been applied to a variety of problems from supervised [42] and semi-supervised learning [8], to clustering [45] and dimensionality reduction [7], to name a few. Interestingly, the problem of learning the manifold itself has received less attention: given samples from a d-manifold M embedded in some ambient space X, the problem is to learn a set that approximates M in a suitable sense. This problem has been considered in computational geometry, but in a setting in which typically the manifold is a hyper-surface in a low-dimensional space (e.g.
Feb-19-2013
- Country:
- North America > United States > Massachusetts (0.28)
- Genre:
- Research Report (0.50)
- Industry:
- Education (0.34)
- Technology: