Education
VModel: A Visual Qualitative Modeling Environment for Middle-school Students
Forbus, Kenneth D., Carney, Karen, Sherin, Bruce L., II, Leo C. Ureel
Learning how to create, test, and revise models is a central skill in scientific reasoning. We argue that qualitative modeling provides an appropriate level of representation for helping middle-school students learn to become modelers. We describe Vmodel, a system we have created that uses visual representations and that enables middle-school students to create qualitative models. Software coaches use simple analyses of model structure plus qualitative simulation to provide feedback and explanations. This system has been used in several studies in Chicago public school classrooms, using curricula developed in collaboration with teachers. We discuss the design of the visual representation language, how Vmodel works, and evidence from school studies that indicate it is successful in helping students.
Making Better Recommendations with Online Profiling Agents
In recent years, we have witnessed the success of autonomous agents applying machine-learning techniques across a wide range of applications. However, agents applying the same machine-learning techniques in online applications have not been so successful. Even agent-based hybrid recommender systems that combine information filtering techniques with collaborative filtering techniques have been applied with considerable success only to simple consumer goods such as movies, books, clothing, and food. Yet complex, adaptive autonomous agent systems that can handle complex goods such as real estate, vacation plans, insurance, mutual funds, and mortgages have emerged. To a large extent, the reinforcement learning methods developed to aid agents in learning have been more successfully deployed in offline applications. The inherent limitations in these methods have rendered them somewhat ineffective in online applications. In this article, we postulate that a small amount of prior knowledge and human-provided input can dramatically speed up online learning. We demonstrate that our agent HumanE -- with its prior knowledge or "experiences" about the real estate domain -- can effectively assist users in identifying requirements, especially unstated ones, quickly and unobtrusively.
Special Issue on Innovative Applications of AI: Guest Editor's Introduction
Randall W. Hill, Jr., Jacobstein, Neil
We are pleased to publish this special selection of articles from the Sixteenth Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-04), which occurred July 27-29, 2004 in San Jose, California. IAAI is the premier venue for learning about AI's impact through deployed applications and emerging AI technologies. Case studies of deployed applications with measurable benefits arising from the use of AI technology provide clear evidence of the impact and value of AI technology to today's world. The emerging applications track features technologies that are rapidly maturing to the point of application. The seven articles selected for this special issue are extended versions of the papers that appeared at the conference. Four of the articles describe deployed applications that are already in use in the field. The other three articles, which are from the emerging technology track, were selected because they are particularly innovative and show great potential for deployment.
Efficiency versus Convergence of Boolean Kernels for On-Line Learning Algorithms
Khardon, R., Roth, D., Servedio, R. A.
The paper studies machine learning problems where each example is described using a set of Boolean features and where hypotheses are represented by linear threshold elements. One method of increasing the expressiveness of learned hypotheses in this context is to expand the feature set to include conjunctions of basic features. This can be done explicitly or where possible by using a kernel function. Focusing on the well known Perceptron and Winnow algorithms, the paper demonstrates a tradeoff between the computational efficiency with which the algorithm can be run over the expanded feature space and the generalization ability of the corresponding learning algorithm. We first describe several kernel functions which capture either limited forms of conjunctions or all conjunctions. We show that these kernels can be used to efficiently run the Perceptron algorithm over a feature space of exponentially many conjunctions; however we also show that using such kernels, the Perceptron algorithm can provably make an exponential number of mistakes even when learning simple functions. We then consider the question of whether kernel functions can analogously be used to run the multiplicative-update Winnow algorithm over an expanded feature space of exponentially many conjunctions. Known upper bounds imply that the Winnow algorithm can learn Disjunctive Normal Form (DNF) formulae with a polynomial mistake bound in this setting. However, we prove that it is computationally hard to simulate Winnow's behavior for learning DNF over such a feature set. This implies that the kernel functions which correspond to running Winnow for this problem are not efficiently computable, and that there is no general construction that can run Winnow with kernels.
Learning Content Selection Rules for Generating Object Descriptions in Dialogue
A fundamental requirement of any task-oriented dialogue system is the ability to generate object descriptions that refer to objects in the task domain. The subproblem of content selection for object descriptions in task-oriented dialogue has been the focus of much previous work and a large number of models have been proposed. In this paper, we use the annotated COCONUT corpus of task-oriented design dialogues to develop feature sets based on Dale and Reiter's (1995) incremental model, Brennan and Clark's (1996) conceptual pact model, and Jordan's (2000b) intentional influences model, and use these feature sets in a machine learning experiment to automatically learn a model of content selection for object descriptions. Since Dale and Reiter's model requires a representation of discourse structure, the corpus annotations are used to derive a representation based on Grosz and Sidner's (1986) theory of the intentional structure of discourse, as well as two very simple representations of discourse structure based purely on recency. We then apply the rule-induction program RIPPER to train and test the content selection component of an object description generator on a set of 393 object descriptions from the corpus. To our knowledge, this is the first reported experiment of a trainable content selection component for object description generation in dialogue. Three separate content selection models that are based on the three theoretical models, all independently achieve accuracies significantly above the majority class baseline (17%) on unseen test data, with the intentional influences model (42.4%) performing significantly better than either the incremental model (30.4%) or the conceptual pact model (28.9%). But the best performing models combine all the feature sets, achieving accuracies near 60%. Surprisingly, a simple recency-based representation of discourse structure does as well as one based on intentional structure. To our knowledge, this is also the first empirical comparison of a representation of Grosz and Sidner's model of discourse structure with a simpler model for any generation task.
The 2004 Mobile Robot Competition and Exhibition
Smart, William D., Tejada, Sheila, Maxwell, Bruce, Stroupe, Ashley, Casper, Jennifer, Jacoff, Adam, Yanco, Holly, Bugajska, Magda
Running services in many small processes improves fault tolerance since any number of services can fail due to programming faults without affecting the rest of the system. While it is clearly important to be able to handle a wide range of failures, application authors should not be required to implement routines to test and react in every known mode of failure for every application, even if the failures are abstracted to a common interface. Thus, the framework also provides transparent fault-tolerance to users of system services. Errors in software and hardware are detected, and corrective action is taken. Services can be restarted or removed from the system, and clients are reconnected to the same service or to another service implementing the same interface without intervention from the application programmer. The Washington University team successfully demonstrated its failure-tolerant framework on its robot, Lewis (figure 6).
The Workshop Program at the Nineteenth National Conference on Artificial Intelligence
Muslea, Ion, Dignum, Virginia, Corkill, Daniel, Jonker, Catholijn, Dignum, Frank, Coradeschi, Silvia, Saffiotti, Alessandro, Fu, Dan, Orkin, Jeff, Cheetham, William E., Goebel, Kai, Bonissone, Piero, Soh, Leen-Kiat, Jones, Randolph M., Wray, Robert E., Scheutz, Matthias, Farias, Daniela Pucci de, Mannor, Shie, Theocharou, Georgios, Precup, Doina, Mobasher, Bamshad, Anand, Sarabjot Singh, Berendt, Bettina, Hotho, Andreas, Guesgen, Hans, Rosenstein, Michael T., Ghavamzadeh, Mohammad
AAAI presented the AAAI-04 workshop program on July 25-26, 2004 in San Jose, California. This program included twelve workshops covering a wide range of topics in artificial intelligence. The titles of the workshops were as follows: (1) Adaptive Text Extraction and Mining; (2) Agent Organizations: Theory and Practice; (3) Anchoring Symbols to Sensor Data; (4) Challenges in Game AI; (5) Fielding Applications of Artificial Intelligence; (6) Forming and Maintaining Coalitions in Adaptive Multiagent Systems; (7) Intelligent Agent Architectures: Combining the Strengths of Software Engineering and Cognitive Systems; (8) Learning and Planning in Markov Processes -- Advances and Challenges; (9) Semantic Web Personalization; (10) Sensor Networks; (11) Spatial and Temporal Reasoning; and (12) Supervisory Control of Learning and Adaptive Systems.
The Sixth International Conference on Enterprise Information Systems (ICEIS 2004
The Sixth International Conference on Enterprise Information Systems (ICEIS) was held in Porto, Portugal; previous venues were in Spain, France, and the United Kingdom. Since its inception in 1999, ICEIS has grown steadily, and is now one of the largest international conferences in the area of information systems. In 2004, more than 600 papers were submitted to the conference and its ten satellite workshops. One of the interesting features of this conference is the high number of invited speakers. In 2004, eighteen keynote speakers were featured at ICEIS and its workshops.
Data Integration: A Logic-Based Perspective
Calvanese, Diego, Giacomo, Giuseppe De
Data integration is the problem of combining data residing at different autonomous, heterogeneous sources and providing the client with a unified, reconciled global view of the data. We discuss dataintegration systems, taking the abstract viewpoint that the global view is an ontology expressed in a class-based formalism. We resort to an expressive description logic, ALCQI, that fully captures classbased representation formalisms, and we show that query answering in data integration, as well as all other relevant reasoning tasks, is decidable. However, when we have to deal with large amounts of data, the high computational complexity in the size of the data makes the use of a fullfledged expressive description logic infeasible in practice. This leads us to consider DL-Lite, a specifically tailored restriction of ALCQI that ensures tractability of query answering in data integration while keeping enough expressive power to capture the most relevant features of class-based formalisms.
Automatically Utilizing Secondary Sources to Align Information Across Sources
Michalowski, Martin, Thakkar, Snehal, Knoblock, Craig A.
XML, web services, and the semantic web have opened the door for new and exciting informationintegration applications. Information sources on the web are controlled by different organizations or people, utilize different text formats, and have varying inconsistencies. Therefore, any system that integrates information from different data sources must identify common entities from these sources. Data from many data sources on the web does not contain enough information to link the records accurately using state-of-the-art record-linkage systems. However, it is possible to exploit secondary data sources on the web to improve the recordlinkage process. We present an approach to accurately and automatically match entities from various data sources by utilizing a state-of-the-art record-linkage system in conjunction with a data-integration system. The data-integration system is able to automatically determine which secondary sources need to be queried when linking records from various data sources. In turn, the record-linkage system is then able to utilize this additional information to improve the accuracy of the linkage between datasets.