Handling Missing Data with Variational Bayesian Learning of ICA
Chan, Kwokleung, Lee, Te-Won, Sejnowski, Terrence J.
–Neural Information Processing Systems
Missing data is common in real-world datasets and is a problem for many estimation techniques. We have developed a variational Bayesian method to perform Independent Component Analysis (ICA) on high-dimensional data containing missing entries. Missing data are handled naturally in the Bayesian framework by integrating the generative density model. Modeling the distributions of the independent sources with mixture of Gaussians allows sources to be estimated with different kurtosis and skewness. The variational Bayesian method automatically determines the dimensionality of the data and yields an accurate density model for the observed data without overfitting problems. This allows direct probability estimation of missing values in the high dimensional space and avoids dimension reduction preprocessing which is not feasible with missing data.
Neural Information Processing Systems
Dec-31-2003
- Country:
- North America > United States > California > San Diego County (0.14)
- Industry:
- Health & Medicine (0.47)