On learning the structure of Bayesian Networks and submodular function maximization

Caravagna, Giulio, Ramazzotti, Daniele, Sanguinetti, Guido

Jun-7-2017–arXiv.org Machine Learning

Learning the structure of dependencies among multiple random variables is a problem of considerable theoretical and practical interest. In practice, score optimisation with multiple restarts provides a practical and surprisingly successful solution, yet the conditions under which this may be a well founded strategy are poorly understood. In this paper, we prove that the problem of identifying the structure of a Bayesian Network via regularised score optimisation can be recast, in expectation, as a submodular optimisation problem, thus guaranteeing optimality with high probability. This result both explains the practical success of optimisation heuristics, and suggests a way to improve on such algorithms by artificially simulating multiple data sets via a bootstrap procedure. We show on several synthetic data sets that the resulting algorithm yields better recovery performance than the state of the art, and illustrate in a real cancer genomic study how such an approach can lead to valuable practical insights.

artificial intelligence, hill climbing, machine learning, (16 more...)

arXiv.org Machine Learning

Jun-7-2017

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.28)

Genre:
- Research Report
  - New Finding (0.68)
  - Experimental Study (0.47)

Industry:
- Health & Medicine
  - Therapeutic Area > Oncology (1.00)
  - Pharmaceuticals & Biotechnology (0.88)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning
    - Uncertainty > Bayesian Inference (1.00)
    - Search (1.00)
    - Optimization (1.00)
  - Machine Learning
    - Performance Analysis > Accuracy (1.00)
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found