Plotting

 Chao Pan



Query K-means Clustering and the Double Dixie Cup Problem

Neural Information Processing Systems

We consider the problem of approximate K-means clustering with outliers and side information provided by same-cluster queries and possibly noisy answers.


Query K-means Clustering and the Double Dixie Cup Problem

Neural Information Processing Systems

We consider the problem of approximate K-means clustering with outliers and side information provided by same-cluster queries and possibly noisy answers.


Group Additive Structure Identification for Kernel Nonparametric Regression

Neural Information Processing Systems

The additive model is one of the most popularly used models for high dimensional nonparametric regression analysis. However, its main drawback is that it neglects possible interactions between predictor variables. In this paper, we reexamine the group additive model proposed in the literature, and rigorously define the intrinsic group additive structure for the relationship between the response variable Y and the predictor vector X, and further develop an effective structure-penalized kernel method for simultaneous identification of the intrinsic group additive structure and nonparametric function estimation. The method utilizes a novel complexity measure we derive for group additive structures. We show that the proposed method is consistent in identifying the intrinsic group additive structure. Simulation study and real data applications demonstrate the effectiveness of the proposed method as a general tool for high dimensional nonparametric regression.