Provable Convex Co-clustering of Tensors
Chi, Eric C., Gaines, Brian R., Sun, Will Wei, Zhou, Hua, Yang, Jian
Cluster analysis is a fundamental tool for pattern discovery of complex heterogeneous data. Prevalent clustering methods mainly focus on vector or matrix-variate data and are not applicable to general-order tensors, which arise frequently in modern scientific and business applications. Moreover, there is a gap between statistical guarantees and computational efficiency for existing tensor clustering solutions due to the nature of their non-convex formulations. In this work, we bridge this gap by developing a provable convex formulation of tensor co-clustering. Our convex co-clustering (CoCo) estimator enjoys stability guarantees and is both computationally and storage efficient. We further establish a non-asymptotic error bound for the CoCo estimator, which reveals a surprising "blessing of dimensionality" phenomenon that does not exist in vector or matrix-variate cluster analysis. Our theoretical findings are supported by extensive simulated studies. Finally, we apply the CoCo estimator to the cluster analysis of advertisement click tensor data from a major online company. Our clustering results provide meaningful business insights to improve advertising effectiveness.
Mar-17-2018
- Country:
- North America > United States > California > Los Angeles County > Los Angeles (0.27)
- Genre:
- Research Report (1.00)
- Industry:
- Marketing (0.69)
- Information Technology (0.66)
- Health & Medicine > Pharmaceuticals & Biotechnology (0.46)
- Government > Regional Government
- North America Government (0.46)
- Technology: