Probabilistic Abstraction Hierarchies

Segal, Eran, Koller, Daphne, Ormoneit, Dirk

Dec-31-2002–Neural Information Processing Systems

Many domains are naturally organized in an abstraction hierarchy or taxonomy, where the instances in "nearby" classes in the taxonomy are similar. In this paper, weprovide a general probabilistic framework for clustering data into a set of classes organized as a taxonomy, where each class is associated with a probabilistic modelfrom which the data was generated. The clustering algorithm simultaneously optimizes three things: the assignment of data instances to clusters, themodels associated with the clusters, and the structure of the abstraction hierarchy. A unique feature of our approach is that it utilizes global optimization algorithms for both of the last two steps, reducing the sensitivity to noise and the propensity to local maxima that are characteristic of algorithms such as hierarchical agglomerativeclustering that only take local steps. We provide a theoretical analysis for our algorithm, showing that it converges to a local maximum of the joint likelihood of model and data.

artificial intelligence, hierarchy, machine learning, (20 more...)

Neural Information Processing Systems

Dec-31-2002

Conferences PDF

Add feedback

Genre:
- Research Report > New Finding (0.68)

Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (0.95)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning
    - Optimization (1.00)
    - Uncertainty > Bayesian Inference (0.47)
  - Machine Learning
    - Statistical Learning > Clustering (0.66)
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (0.47)