Multivariate Beta Mixture Model: Probabilistic Clustering With Flexible Cluster Shapes
Hsu, Yung-Peng, Chen, Hung-Hsuan
–arXiv.org Artificial Intelligence
Data clustering groups data points into components so that similar points are within the same component. Data clustering is commonly used for data exploration and is sometimes used as a preprocessing step for later analysis [1]. In this paper, the multivariate beta mixture model (MBMM), a new probabilistic model for soft clustering, is proposed. As the MBMM is a mixture model, it shares many properties with the Gaussian mixture model (GMM), including its soft cluster assignment and parametric modeling. In addition, the MBMM allows the generation of new (synthetic) instances based on a generative process. Because the beta distribution is highly flexible (e.g., unimodal, bimodal, straight line, or exponentially increasing or decreasing), MBMM can fit data with versatile shapes.
arXiv.org Artificial Intelligence
Jan-29-2024