Government
Automatic Discovery and Transfer of Task Hierarchies in Reinforcement Learning
Mehta, Neville (Oregon State University) | Ray, Soumya (Case Western Reserve University) | Tadepalli, Prasad (Oregon State University) | Dietterich, Thomas (Oregon State University)
Sequential decision tasks present many opportunities for the study of transfer learning. A principal one among them is the existence of multiple domains that share the same underlying causal structure for actions. We describe an approach that exploits this shared causal structure to discover a hierarchical task structure in a source domain, which in turn speeds up learning of task execution knowledge in a new target domain. Our approach is theoretically justified and compares favorably to manually designed task hierarchies in learning efficiency in the target domain. We demonstrate that causally motivated task hierarchies transfer more robustly than other kinds of detailed knowledge that depend on the idiosyncrasies of the source domain and are hence less transferable.
Reports of the AAAI 2010 Fall Symposia
Azevedo, Roger (McGill University) | Biswas, Gautam (Vanderbilt University) | Bohus, Dan (Microsoft Research) | Carmichael, Ted (University of North Carolina at Charlotte) | Finlayson, Mark (Massachusetts Institute of Technology) | Hadzikadic, Mirsad (University of North Carolina at Charlotte) | Havasi, Catherine (Massachusetts Institute of Technology) | Horvitz, Eric (Microsoft Research) | Kanda, Takayuki (ATR Intelligent Robotics and Communications Laboratories) | Koyejo, Oluwasanmi (University of Texas at Austin) | Lawless, William (Paine College) | Lenat, Doug (Cycorp) | Meneguzzi, Felipe (Carnegie Mellon University) | Mutlu, Bilge (University of Wisconsin, Madison) | Oh, Jean (Carnegie Mellon University) | Pirrone, Roberto (University of Palermo) | Raux, Antoine (Honda Research Institute USA) | Sofge, Donald (Naval Research Laboratory) | Sukthankar, Gita (University of Central Florida) | Durme, Benjamin Van (Johns Hopkins University)
The Association for the Advancement of Artificial Intelligence was pleased to present the 2010 Fall Symposium Series, held Thursday through Saturday, November 11-13, at the Westin Arlington Gateway in Arlington, Virginia. The titles of the eight symposia are as follows: (1) Cognitive and Metacognitive Educational Systems; (2) Commonsense Knowledge; (3) Complex Adaptive Systems: Resilience, Robustness, and Evolvability; (4) Computational Models of Narrative; (5) Dialog with Robots; (6) Manifold Learning and Its Applications; (7) Proactive Assistant Agents ; and (8) Quantum Informatics for Cognitive, Social, and Semantic Processes. The highlights of each symposium are presented in this report.
Transfer Learning through Analogy in Games
Hinrichs, Thomas (Northwestern University) | Forbus, Kenneth D. (Northwestern University)
We have explored the use of analogy as a general approach to near and far transfer learning in domains ranging from physics problem solving to strategy games (Klenk and Forbus 2007; Hinrichs and Forbus 2007). Using the same basic analogical mechanism, we have found that the main differences between near and far transfer involve the amount of generalization that must be performed prior to transfer and the way that the matching process treats nonidentical predicates. We present here two extensions of our analogical matcher, minimal ascension and metamapping, that enable far transfer between representations with different relational vocabulary. Evidence for the effectiveness of these techniques is provided by a large-scale external evaluation, involving a substantial number of novel distant analogs.
Deep Transfer: A Markov Logic Approach
Davis, Jesse (Katholieke Universiteit Leuven) | Domingos, Pedro (University of Washington)
This article argues that currently the largest gap between human and machine learning is learning algorithms' inability to perform deep transfer, that is, generalize from one domain to another domain containing different objects, classes, properties and relations. We argue that second-order Markov logic is ideally suited for this purpose, and propose an approach based on it. Our algorithm discovers structural regularities in the source domain in the form of Markov logic formulas with predicate variables, and instantiates these formulas with predicates from the target domain. Our approach has successfully transferred learned knowledge among molecular biology, Web and social network domains.
The Special Issue of AI Magazine on Structured Knowledge Transfer
Shapiro, Daniel G. (Institute for the Study of Learning and Expertise) | Munoz-Avila, Hector (Lehigh University) | Stracuzzi, David (Sandia National Laboratories)
This issue summarizes the state of the art in structured knowledge transfer, which is an emerging approach to the general problem of knowledge acquisition and reuse. Its goal is to capture, in a general form, the internal structure of the objects, relations, strategies, and processes used to solve tasks drawn from a source domain, and exploit that knowledge to improve performance in a target domain.
Robust Nonparametric Regression via Sparsity Control with Application to Load Curve Data Cleansing
Mateos, Gonzalo, Giannakis, Georgios B.
Nonparametric methods are widely applicable to statistical inference problems, since they rely on a few modeling assumptions. In this context, the fresh look advocated here permeates benefits from variable selection and compressive sampling, to robustify nonparametric regression against outliers - that is, data markedly deviating from the postulated models. A variational counterpart to least-trimmed squares regression is shown closely related to an L0-(pseudo)norm-regularized estimator, that encourages sparsity in a vector explicitly modeling the outliers. This connection suggests efficient solvers based on convex relaxation, which lead naturally to a variational M-type estimator equivalent to the least-absolute shrinkage and selection operator (Lasso). Outliers are identified by judiciously tuning regularization parameters, which amounts to controlling the sparsity of the outlier vector along the whole robustification path of Lasso solutions. Reduced bias and enhanced generalization capability are attractive features of an improved estimator obtained after replacing the L0-(pseudo)norm with a nonconvex surrogate. The novel robust spline-based smoother is adopted to cleanse load curve data, a key task aiding operational decisions in the envisioned smart grid system. Computer simulations and tests on real load curve data corroborate the effectiveness of the novel sparsity-controlling robust estimators.
Representing First-Order Causal Theories by Logic Programs
Ferraris, Paolo, Lee, Joohyung, Lierler, Yuliya, Lifschitz, Vladimir, Yang, Fangkai
Nonmonotonic causal logic, introduced by Norman McCain and Hudson Turner, became a basis for the semantics of several expressive action languages. McCain's embedding of definite propositional causal theories into logic programming paved the way to the use of answer set solvers for answering queries about actions described in such languages. In this paper we extend this embedding to nondefinite theories and to first-order causal logic.
"Meaning" as a sociological concept: A review of the modeling, mapping, and simulation of the communication of knowledge and meaning
The development of discursive knowledge presumes the communication of meaning as analytically different from the communication of information. Knowledge can then be considered as a meaning which makes a difference. Whereas the communication of information is studied in the information sciences and scientometrics, the communication of meaning has been central to Luhmann's attempts to make the theory of autopoiesis relevant for sociology. Analytical techniques such as semantic maps and the simulation of anticipatory systems enable us to operationalize the distinctions which Luhmann proposed as relevant to the elaboration of Husserl's "horizons of meaning" in empirical research: interactions among communications, the organization of meaning in instantiations, and the self-organization of interhuman communication in terms of symbolically generalized media such as truth, love, and power. Horizons of meaning, however, remain uncertain orders of expectations, and one should caution against reification from the meta-biological perspective of systems theory.
Voting Theory, Data Fusion, and Explanations of Social Behavior
Urken, Arnold B. (University of Arizona)
The challenge of using communications infrastructure to stabilize other infrastructures is related to research on the collective communications systems in social animals, robots, and human-non-human interaction. In these systems, voting models can explicate patterns of observed behavior or predict collective outcomes. Developing more theoretical deductive explanatory power can increase our knowledge about the interplay of voters and communication that produces collective inferences. This paper suggests that many analyses of voting patterns have not integrated what is known about the predictive properties of voting processes into their analyses. Taking a more deductive approach enables us to think about the strengths and weaknesses of existing explanations and imagine new types of analysis that have implications for engineering communications systems to stabilize other infrastructures.