Education
Reports of the AAAI 2011 Conference Workshops
Agmon, Noa (University of Texas at Austin) | Agrawal, Vikas (Infosys Labs) | Aha, David W. (Naval Research Laboratory) | Aloimonos, Yiannis (University of Maryland, College Park) | Buckley, Donagh (EMC) | Doshi, Prashant (University of Georgia) | Geib, Christopher (University of Edinburgh) | Grasso, Floriana (University of Liverpool) | Green, Nancy (University of North Carolina Greensboro) | Johnston, Benjamin (University of Technology, Sydney) | Kaliski, Burt (VeriSign, Inc.) | Kiekintveld, Christopher (University of Texas at El Paso) | Law, Edith (Carnegie Mellon University) | Lieberman, Henry (Massachusetts Institute of Technology) | Mengshoel, Ole J. (Carnegie Mellon University) | Metzler, Ted (Oklahoma City University) | Modayil, Joseph (University of Alberta) | Oard, Douglas W. (University of Maryland, College Park) | Onder, Nilufer (Michigan Technological University) | O' (University College Cork) | Sullivan, Barry (Cognitive Systems Research Insitute) | Pastra, Katerina (McGill University) | Precup, Doina (Stottler Henke Associates, Inc.) | Ramachandran, Sowmya (University of Dundee) | Reed, Chris (Istanbul Technical University) | Sariel-Talay, Sanem (Carnegie Mellon University) | Selker, Ted (Infosys Technologies Ltd.) | Shastri, Lokendra (Carnegie Mellon University) | Smith, Stephen F. (University of Michigan at Ann Arbor) | Singh, Satinder (University of Wisconsin, Madison) | Srivastava, Siddharth (University of Central Florida) | Sukthankar, Gita (Naval Research Laboratory) | Uthus, David C. (University of Technology, Sydney) | Williams, Mary-Anne
The AAAI-11 workshop program was held Sunday and Monday, August 7โ18, 2011, at the Hyatt Regency San Francisco in San Francisco, California USA. The AAAI-11 workshop program included 15 workshops covering a wide range of topics in artificial intelligence. The titles of the workshops were Activity Context Representation: Techniques and Languages; Analyzing Microtext; Applied Adversarial Reasoning and Risk Modeling; Artificial Intelligence and Smarter Living: The Conquest of Complexity; AI for Data Center Management and Cloud Computing; Automated Action Planning for Autonomous Mobile Robots; Computational Models of Natural Argument; Generalized Planning; Human Computation; Human-Robot Interaction in Elder Care; Interactive Decision Theory and Game Theory; Language-Action Tools for Cognitive Artificial Agents: Integrating Vision, Action and Language; Lifelong Learning; Plan, Activity, and Intent Recognition; and Scalable Integration of Analytics and Visualization. This article presents short summaries of those events.
Reports of the AAAI 2011 Fall Symposia
Blisard, Sam (Naval Research Laboratory) | Carmichael, Ted (University of North Carolina at Charlotte) | Ding, Li (University of Maryland, Baltimore County) | Finin, Tim (University of Maryland, Baltimore County) | Frost, Wende (Naval Research Laboratory) | Graesser, Arthur (University of Memphis) | Hadzikadic, Mirsad (University of North Carolina at Charlotte) | Kagal, Lalana (Massachusetts Institute of Technology) | Kruijff, Geert-Jan M. (German Research Center for Artificial Intelligence) | Langley, Pat (Arizona State University) | Lester, James (North Carolina State University) | McGuinness, Deborah L. (Rensselaer Polytechnic Institute) | Mostow, Jack (Carnegie Mellon University) | Papadakis, Panagiotis (University of Sapienza, Rome) | Pirri, Fiora (Sapienza University of Rome) | Prasad, Rashmi (University of Wisconsin-Milwaukee) | Stoyanchev, Svetlana (Columbia University) | Varakantham, Pradeep (Singapore Management University)
The Association for the Advancement of Artificial Intelligence was pleased to present the 2011 Fall Symposium Series, held Friday through Sunday, November 4โ6, at the Westin Arlington Gateway in Arlington, Virginia. The titles of the seven symposia are as follows: (1) Advances in Cognitive Systems; (2) Building Representations of Common Ground with Intelligent Agents; (3) Complex Adaptive Systems: Energy, Information and Intelligence; (4) Multiagent Coordination under Uncertainty; (5) Open Government Knowledge: AI Opportunities and Challenges; (6) Question Generation; and (7) Robot-Human Teamwork in Dynamic Adverse Environment. The highlights of each symposium are presented in this report.
The AAAI 2011 Robot Exhibition
Chernova, Sonia (Worcester Polytechnic Institut) | Dodds, Zachary (Harvey Mudd College) | Stilman, Mike (Georgia Institute of Technology) | Touretzky, Dave (Carnegie Mellon University) | Thomaz, Andrea L. (Georgia Institute of Technology)
On the day before the exhibition the participants convened a workshop of 18 short talks. Each track's exhibitors presented a summary of their exhibit. In addition, four guest speakers provided a broader context for all of the exhibitors' efforts. The first guest speaker was the National Science Foundation's Sven Koenig, who highlighted several federal programs that support projects in embodied intelligence. Koenig also provided insights into some of these program's specific priorities, such as international collaborations and educational engagement.
Mapping the Landscape of Human-Level Artificial General Intelligence
Adams, Sam (IBM) | Arel, Itmar (University of Tennessee) | Bach, Joscha (Humboldt University of Berlin) | Coop, Robert (University of Tennessee) | Furlan, Rod (Quaternix Research, Inc.) | Goertzel, Ben (Independent Researcher and Author) | Hall, J. Storrs (George Mason University) | Samsonovich, Alexei (Tufts University) | Scheutz, Matthias (Southern Illinois University, Carbondale) | Schlesinger, Matthew (University of Buffalo, State University of New York) | Shapiro, Stuart C. (VivoMind Research, LLC) | Sowa, John
Of course, this is far from the first attempt to plot a course toward human-level AGI: arguably this was the goal of the founders of the field of artificial intelligence in the 1950s, and has been pursued by a steady stream of AI researchers since, even as the majority of the AI field has focused its attention on more narrow, specific subgoals. The ideas presented here build on the ideas of others in innumerable ways, but to review the history of AI and situate the current effort in the context of its predecessors would require a much longer article than this one. Thus we have chosen to focus on the results of our AGI roadmap discussions, acknowledging in a broad way the many debts owed to many prior researchers. References to the prior literature on evaluation of advanced AI systems are given by Laird (Laird et al. 2009) and Geortzel and Bugaj (2009), which may in a limited sense be considered prequels to this article. We begin by discussing AGI in general and adopt a pragmatic goal for measuring progress toward its attainment. An initial capability landscape for AGI The heterogeneity of general intelligence in will be presented, drawing on major themes from humans makes it practically impossible to develop developmental psychology and illuminated by a comprehensive, fine-grained measurement system mathematical, physiological, and informationprocessing for AGI. While we encourage research in defining perspectives. The challenge of identifying such high-fidelity metrics for specific capabilities, appropriate tasks and environments for measuring we feel that at this stage of AGI development AGI will be taken up. Several scenarios will a pragmatic, high-level goal is the best we can be presented as milestones outlining a roadmap agree upon. I advocate beginning with a system that has minimal, although extensive, built-in capabilities. Many variant approaches have been proposed A classic example of the narrow AI approach was for achieving such a goal, and both the AI and AGI IBM's Deep Blue system (Campbell, Hoane, and communities have been working for decades on Hsu 2002), which successfully defeated world chess the myriad subgoals that would have to be champion Gary Kasparov but could not readily achieved and integrated to deliver a comprehensive apply that skill to any other problem domain without AGI system.
Challenges and Opportunities in Applied Machine Learning
Brodley, Carla E. (Tufts University) | Rebbapragada, Umaa (Jet Propulsion Laboratory) | Small, Kevin (Tufts Medical Center) | Wallace, Byron (Tufts University)
Machine learning research is often conducted in vitro, divorced from motivating practical applications. A researcher might develop a new method for the general task of classification, then assess its utility by comparing its performance (such as accuracy or AUC) to that of existing classification models on publicly available datasets. In terms of advancing machine learning as an academic discipline, this approach has thus far proven quite fruitful. However, it is our view that the most interesting open problems in machine learning are those that arise during its application to real-world problems. We illustrate this point by reviewing two of our interdisciplinary collaborations, both of which have posed unique machine learning problems, providing fertile ground for novel research.
Model-based Utility Functions
Orseau and Ring, as well as Dewey, have recently described problems, including self-delusion, with the behavior of agents using various definitions of utility functions. An agent's utility function is defined in terms of the agent's history of interactions with its environment. This paper argues, via two examples, that the behavior problems can be avoided by formulating the utility function in two steps: 1) inferring a model of the environment from interactions, and 2) computing utility as a function of the environment model. Basing a utility function on a model that the agent must learn implies that the utility function must initially be expressed in terms of specifications to be matched to structures in the learned model. These specifications constitute prior assumptions about the environment so this approach will not work with arbitrary environments. But the approach should work for agents designed by humans to act in the physical world. The paper also addresses the issue of self-modifying agents and shows that if provided with the possibility to modify their utility functions agents will not choose to do so, under some usual assumptions.
Multi-Task Feature Learning Via Efficient l2,1-Norm Minimization
Liu, Jun, Ji, Shuiwang, Ye, Jieping
The problem of joint feature selection across a group of related tasks has applications in many areas including biomedical informatics and computer vision. We consider the l2,1-norm regularized regression model for joint feature selection from multiple tasks, which can be derived in the probabilistic framework by assuming a suitable prior from the exponential family. One appealing feature of the l2,1-norm regularization is that it encourages multiple predictors to share similar sparsity patterns. However, the resulting optimization problem is challenging to solve due to the non-smoothness of the l2,1-norm regularization. In this paper, we propose to accelerate the computation by reformulating it as two equivalent smooth convex optimization problems which are then solved via the Nesterov's method-an optimal first-order black-box method for smooth convex optimization. A key building block in solving the reformulations is the Euclidean projection. We show that the Euclidean projection for the first reformulation can be analytically computed, while the Euclidean projection for the second one can be computed in linear time. Empirical evaluations on several data sets verify the efficiency of the proposed algorithms.
Virtual Vector Machine for Bayesian Online Classification
Minka, Thomas P., Xiang, Rongjing, Yuan, null, Qi, null
In a typical online learning scenario, a learner is required to process a large data stream using a small memory buffer. Such a requirement is usually in conflict with a learner's primary pursuit of prediction accuracy. To address this dilemma, we introduce a novel Bayesian online classification algorithm, called the Virtual Vector Machine. The virtual vector machine allows you to smoothly tradeoff prediction accuracy with memory size. The virtual vector machine summarizes the information contained in the preceding data stream by a Gaussian distribution over the classification weights plus a constant number of virtual data points. The virtual data points are designed to add extra non-Gaussian information about the classification weights. To maintain the constant number of virtual points, the virtual vector machine adds the current real data point into the virtual point set, merges two most similar virtual points into a new virtual point or deletes a virtual point that is far from the decision boundary. The information lost in this process is absorbed into the Gaussian distribution. The extra information provided by the virtual points leads to improved predictive accuracy over previous online classification algorithms.
Effects of Treatment on the Treated: Identification and Generalization
Many applications of causal analysis call for assessing, retrospectively, the effect of withholding an action that has in fact been implemented. This counterfactual quantity, sometimes called "effect of treatment on the treated," (ETT) have been used to to evaluate educational programs, critic public policies, and justify individual decision making. In this paper we explore the conditions under which ETT can be estimated from (i.e., identified in) experimental and/or observational studies. We show that, when the action invokes a singleton variable, the conditions for ETT identification have simple characterizations in terms of causal diagrams. We further give a graphical characterization of the conditions under which the effects of multiple treatments on the treated can be identified, as well as ways in which the ETT estimand can be constructed from both interventional and observational distributions.
The Discrete Infinite Logistic Normal Distribution
Paisley, John, Wang, Chong, Blei, David
We present the discrete infinite logistic normal distribution (DILN), a Bayesian nonparametric prior for mixed membership models. DILN is a generalization of the hierarchical Dirichlet process (HDP) that models correlation structure between the weights of the atoms at the group level. We derive a representation of DILN as a normalized collection of gamma-distributed random variables, and study its statistical properties. We consider applications to topic modeling and derive a variational inference algorithm for approximate posterior inference. We study the empirical performance of the DILN topic model on four corpora, comparing performance with the HDP and the correlated topic model (CTM). To deal with large-scale data sets, we also develop an online inference algorithm for DILN and compare with online HDP and online LDA on the Nature magazine, which contains approximately 350,000 articles.