Using Tarjan's Red Rule for Fast Dependency Tree Construction

Dec-31-2003–Neural Information Processing Systems

We focus on the problem of efficient learning of dependency trees. It is well-known that given the pairwise mutual information coefficients, a minimum-weight spanning tree algorithm solves this problem exactly and in polynomial time. However, for large data-sets it is the construction ofthe correlation matrix that dominates the running time. We have developed a new spanning-tree algorithm which is capable of exploiting partial knowledge about edge weights. The partial knowledge we maintain isa probabilistic confidence interval on the coefficients, which we derive by examining just a small sample of the data. The algorithm is able to flag the need to shrink an interval, which translates to inspection ofmore data for the particular attribute pair. Experimental results show running time that is near-constant in the number of records, without significantloss in accuracy of the generated trees. Interestingly, our spanning-tree algorithm is based solely on Tarjan's red-edge rule, which is generally considered a guaranteed recipe for bad performance.

algorithm, artificial intelligence, data mining, (16 more...)

Neural Information Processing Systems

Dec-31-2003

Conferences PDF

Add feedback

Country:
- North America > United States
  - California > San Francisco County
    - San Francisco (0.14)
  - Colorado (0.14)
  - Pennsylvania > Allegheny County
    - Pittsburgh (0.14)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Learning Graphical Models
      - Directed Networks > Bayesian Learning (1.00)
    - Representation & Reasoning > Uncertainty (0.69)
  - Data Science > Data Mining (0.68)

Duplicate Docs Excel Report

Title
Using Tarjan's Red Rule for Fast Dependency Tree Construction
Using Tarjan's Red Rule for Fast Dependency Tree Construction

Similar Docs Excel Report more

Title	Similarity	Source
None found