Flexible Clustering with a Sparse Mixture of Generalized Hyperbolic Distributions
Gallaugher, Michael P. B., Tang, Yang, McNicholas, Paul D.
Robust clustering of high-dimensional data is an important topic because, in many practical situations, real data sets are heavy-tailed and/or asymmetric. Moreover, traditional model-based clustering often fails for high dimensional data due to the number of free covariance parameters. A parametrization of the component scale matrices for the mixture of generalized hyperbolic distributions is proposed by including a penalty term in the likelihood constraining the parameters resulting in a flexible model for high dimensional data and a meaningful interpretation. An analytically feasible EM algorithm is developed by placing a gamma-Lasso penalty constraining the concentration matrix. The proposed methodology is investigated through simulation studies and two real data sets.
Mar-12-2019
- Country:
- North America
- Canada > Ontario
- Hamilton (0.14)
- United States > California (0.28)
- Canada > Ontario
- North America
- Genre:
- Research Report (0.64)
- Industry:
- Health & Medicine (1.00)
- Technology: