Generalized Identifiability Bounds for Mixture Models with Grouped Samples

Vandermeulen, Robert A., Saitenmacher, René

Jul-22-2022–arXiv.org Artificial Intelligence

Recent work has shown that finite mixture models with $m$ components are identifiable, while making no assumptions on the mixture components, so long as one has access to groups of samples of size $2m-1$ which are known to come from the same mixture component. In this work we generalize that result and show that, if every subset of $k$ mixture components of a mixture model are linearly independent, then that mixture model is identifiable with only $(2m-1)/(k-1)$ samples per group. We further show that this value cannot be improved. We prove an analogous result for a stronger form of identifiability known as "determinedness" along with a corresponding lower bound. This independence assumption almost surely holds if mixture components are chosen randomly from a $k$-dimensional space. We describe some implications of our results for multinomial mixture models and topic modeling.

artificial intelligence, linearly independent, machine learning, (18 more...)

arXiv.org Artificial Intelligence

Jul-22-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Pennsylvania (0.04)
  - North Carolina (0.04)
  - New York > New York County
    - New York City (0.04)
  - California > Los Angeles County
    - Long Beach (0.04)
  - Arizona > Maricopa County
    - Scottsdale (0.04)
- Europe
  - Germany > Berlin (0.04)
  - Iceland > Capital Region
    - Reykjavik (0.04)

Genre:
- Research Report > New Finding (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Representation & Reasoning (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found