Jensen, David
Learning to Uncover Deep Musical Structure
Kirlin, Phillip (Rhodes College) | Jensen, David (University of Massachusetts Amherst)
The overarching goal of music theory is to explain the inner workings of a musical composition by examining the structure of the composition. Schenkerian music theory supposes that Western tonal compositions can be viewed as hierarchies of musical objects. The process of Schenkerian analysis reveals this hierarchy by identifying connections between notes or chords of a composition that illustrate both the small- and large-scale construction of the music. We present a new probabilistic model of this variety of music analysis, details of how the parameters of the model can be learned from a corpus, an algorithm for deriving the most probable analysis for a given piece of music, and both quantitative and human-based evaluations of the algorithm's performance. This represents the first large-scale data-driven computational approach to hierarchical music analysis.
Strategy Mining
Xu, Xiaoxi (University of Massachusetts, Amherst) | Jensen, David (University of Massachusetts, Amherst) | Rissland, Edwina L. (University of Massachusetts, Amherst)
Strategy mining is a new area of research about discovering strategies for decision-making. It is motivated by how similarity is assessed in retrospect in law. In the legal domain, when both case facts and court decisions are present, it is often useful to assess similarity by accounting for both case facts and case outcomes. In this paper, we formulate the strategy mining problem as a clustering problem with the goal of finding clusters that represent disparate conditional dependency of decision labels on other features. Existing clustering algorithms are inappropriate to cluster dependency because they either assume feature independence, such as K-means, or only consider the co-occurrence of features without explicitly modeling the special dependency of the decision label on other features, such as Latent Dirichlet Allocation (LDA). We propose an Expectation Maximization (EM) style unsupervised learning algorithm for dependency clustering. Like EM, our algorithm is grounded in statistical learning theory. It minimizes the empirical risk of decision tree learning. Unlike other clustering algorithms, our algorithm is irrelevant-feature resistant, and its learned clusters modeled by decision trees are strongly interpretable and predictive. We systematically evaluate both the convergence property and solution quality of our algorithm using a common law dataset comprised of actual cases. Experimental results show that our algorithm significantly outperforms K-means and LDA on clustering dependency
Reasoning about Independence in Probabilistic Models of Relational Data
Maier, Marc, Marazopoulou, Katerina, Jensen, David
We extend the theory of d-separation to cases in which data instances are not independent and identically distributed. We show that applying the rules of d-separation directly to the structure of probabilistic models of relational data inaccurately infers conditional independence. We introduce relational d-separation, a theory for deriving conditional independence facts from relational models. We provide a new representation, the abstract ground graph, that enables a sound, complete, and computationally efficient method for answering d-separation queries about relational models, and we present empirical results that demonstrate effectiveness.
Invited Talks
Doyle, Richard J. (NASA Jet Propulsion Laboratory) | Dumontier, Michel (Stanford University) | Hirsh, Haym (Cornell University) | Jensen, David (University of Massachusetts at Amherst) | Karp, Peter (SRI International) | Monteleoni, Claire (George Washington University) | Obradovic, Zoen (Temple University) | Re, Christopher (Stanford University) | Rzhetsky, Andrey (University of Chicago) | Wagstaff, Kiri L. (NASA Jet Propulsion Laboratory)
Abstracts of the invited talks presented at the AAAI Fall Symposium on Discovery Informatics: AI Takes a Science-Centered View on Big Data. Talks includeย A Data Lifecycle Approach to Discovery Informatics,ย Generating Biomedical Hypotheses Using Semantic Web Technologies,ย Socially Intelligent Science, Representing and Reasoning with Experimental and Quasi-Experimental Designs, Bioinformatics Computation of Metabolic Models from Sequenced Genomes, Climate Informatics: Recent Advances and Challenge Problems for Machine Learning in Climate Science,ย Predictive Modeling of Patient State and Therapy Optimization, Case Studies in Data-Driven Systems: Building Carbon Maps to Finding Neutrinos,ย Computational Analysis of Complex Human Disorders, and Look at This Gem: Automated Data Prioritization for Scientific Discovery of Exoplanets, Mineral Deposits, and More.
Identifying Independence in Relational Models
Maier, Marc, Jensen, David
The rules of d-separation provide a framework for deriving conditional independence facts from model structure. However, this theory only applies to simple directed graphical models. We introduce relational d-separation, a theory for deriving conditional independence in relational models. We provide a sound, complete, and computationally efficient method for relational d-separation, and we present empirical results that demonstrate effectiveness.
Relational Blocking for Causal Discovery
Rattigan, Matthew (University of Massachusetts Amherst) | Maier, Marc (University of Massachusetts Amherst) | Jensen, David (University of Massachusetts Amherst)
Blocking is a technique commonly used in manual statistical analysis to account for confounding variables. However, blocking is not currently used in automated learning algorithms. These algorithms rely solely on statistical conditioning as an operator to identify conditional independence. In this work, we present relational blocking as a new operator that can be used for learning the structure of causal models. We describe how blocking is enabled by relational data sets, where blocks are determined by the links in the network. By blocking on entities rather than conditioning on variables, relational blocking can account for both measured and unobserved variables. We explain the mechanism of these methods using graphical models and the semantics of d-separation. Finally, we demonstrate the effectiveness of relational blocking for use in causal discovery by showing how blocking can be used in the causal analysis of two real-world social media systems.
Learning Causal Models of Relational Domains
Maier, Marc (University of Massachusetts Amherst) | Taylor, Brian (University of Massachusetts Amherst) | Oktay, Huseyin (University of Massachusetts Amherst) | Jensen, David (University of Massachusetts Amherst)
Methods for discovering causal knowledge from observational data have been a persistent topic of AI research for several decades. Essentially all of this work focuses on knowledge representations for propositional domains. In this paper, we present several key algorithmic and theoretical innovations that extend causal discovery to relational domains. We provide strong evidence that effective learning of causal models is enhanced by relational representations. We present an algorithm, relational PC, that learns causal dependencies in a state-of-the-art relational representation, and we identify the key representational and algorithmic innovations that make the algorithm possible. Finally, we prove the algorithm's theoretical correctness and demonstrate its effectiveness on synthetic and real data sets.
Reports on the AAAI Fall Symposia (November 1999 and November 1998)
Daud, Fawzi, Mateas, Michael, Sengers, Phoebe, Brennan, Susan, Giboin, Alain, Traum, David, Chaudri, Vinay, Fikes, Richard E., Scott, Donia, Power, Richard, Jensen, David
The 1999 Association for the Advancement of Artificial Intelligence Fall Symposium Series was held Friday through Sunday, 5-7 November 1999, at the Sea Crest Oceanfront Resort and Conference Center. The titles of the five symposia were (1) Modal and Temporal Logics-Based Planning for Open Networked Multimedia Systems; (2) Narrative Intelligence; (3) Psychological Models of Communication in Collaborative Systems; (4) Question-Answering Systems; and (5) Using Layout for the Generation, Understanding, or Retrieval of Documents.
Reports on the AAAI Fall Symposia (November 1999 and November 1998)
Daud, Fawzi, Mateas, Michael, Sengers, Phoebe, Brennan, Susan, Giboin, Alain, Traum, David, Chaudri, Vinay, Fikes, Richard E., Scott, Donia, Power, Richard, Jensen, David
We order its events and find meaning in them by assimilating them to more or less familiar narratives. Temporal A wide variety of systems were presented: 1999, at the Sea Crest Oceanfront and modal logics have been used to story generation, interactive Resort and Conference Center. The reason about time, action, and adaptive fiction (including the first public titles of the five symposia were change and to program and verify demonstration from Joseph Bates's networked systems. How can we create characters from specifications of service quality in which interactive narrative emerges? The symposium focused mainly on a single, comprehensive theoretical framework, Clark's grounding model.