Goto

Collaborating Authors

 Personal


Feature-Weighted Linear Stacking

arXiv.org Artificial Intelligence

Ensemble methods, such as stacking, are designed to boost predictive accuracy by blending the predictions of multiple machine learning models. Recent work has shown that the use of meta-features, additional inputs describing each example in a dataset, can boost the performance of ensemble methods, but the greatest reported gains have come from nonlinear procedures requiring significant tuning and training time. Here, we present a linear technique, Feature-Weighted Linear Stacking (FWLS), that incorporates meta-features for improved accuracy while retaining the well-known virtues of linear regression regarding speed, stability, and interpretability. FWLS combines model predictions linearly using coefficients that are themselves linear functions of meta-features. This technique was a key facet of the solution of the second place team in the recently concluded Netflix Prize competition. Significant increases in accuracy over standard linear stacking are demonstrated on the Netflix Prize collaborative filtering dataset.


Next-Generation Automated Health Behavior Coaches

AAAI Conferences

Automated health behavior coaches (HBCs) potentially can provide a widely accessible, cost-effective means of promoting health behavior. Coaches are intelligent agents that “converse” with users, offering tailored feedback, advice, and empathy. Research subjects like coaches and comply with target behaviors, but interest and adherence wane over time. More research is needed on next-generation HBCs to improve coaching techniques, enhance user engagement, and extend adherence. However, the necessary technical tools and expertise reside in only a few research labs. In an effort to expand and accelerate research, we are developing an HBC Kit that will extend and specialize our more general Imp™ Kit. We propose 7 innovations for next-generation HBCs, demonstrate them in a lifestyle coach, and characterize authoring with the Imp Kit. We discuss planned extensions for the HBC Kit to enable a larger and more diverse community to create and evaluate a broader range of coaches.


Using Virtual Patients to Train Clinical Interviewing Skills

AAAI Conferences

Virtual patients are viewed as a cost-effective alternative to standardized patients for role-play training of clinical interviewing skills. However, training studies produce mixed results. Students give high ratings to practice with virtual patients and feel more self-confident, but they show little improvement in objective skills. This confidence-competence gap matches a common cognitive illusion, in which students overestimate the effectiveness of training that is too easy. We hypothesize that cost-effective training requires virtual patients that emphasize functional and psychological fidelity over physical fidelity. We discuss 12 design decisions aimed at cost-effective training and their application in virtual patients for practicing brief intervention in alcohol abuse. Our STAR Workshop includes 3 such patients and a virtual coach. A controlled experiment evaluated STAR and compared it to an easier E-Book and no-training Control. E-Book subjects displayed the illusion, giving high ratings to their training and self-confidence, but performing no better than Control subjects on skills. STAR subjects gave high ratings to their training and self-confidence and scored better higher than E-Book or Control subjects on skills. We invite other researchers to use the underlying Imp technology to build virtual patients for their own work.


GnuTutor: An Open Source Intelligent Tutoring System Based on AutoTutor

AAAI Conferences

This paper presents GnuTutor, an open source intelligent tutoring system (ITS) inspired by the AutoTutor ITS. The goal of GnuTutor is to create a freely available, open source ITS platform that can be used by schools and researchers alike. To achieve this goal, significant departures from AutoTutor's current design were made so that GnuTutor would use a smaller, non-proprietary code base but have the major functionality of AutoTutor, including mixed-initiative dialogue, an animated agent, speech act classification, and natural language understanding using latent semantic analysis. This paper describes the GnuTutor system, its components, and the major differences between GnuTutor and AutoTutor.


Essay in the Style of Douglas Hofstadter

AI Magazine

It was written not by a human being, but by my computer program EWI (an acronym for "experiments in writing intelligence"). EWI was fed the texts of two of Hofstadter's books--namely, Gödel, Escher, Bach (winner of the Pulitzer Prize for General Nonfiction in 1980) and Metamagical Themas--and then, following its code, EWI carefully analyzed these two books for their uniquely Hofstadterian stylistic elements and features, after which it recombined these stylistic elements in new fashions. EWI thereby came up with some 25 new and highly diverse "Hofstadter articles," one of which is given below, and the article is followed by a brief commentary about EWI and its output by Hofstadter himself. Actually, I should state up front that the wonderful sparkling dialogues of GEB, which are a substantial part of that book, were not used by EWI in generating any of the articles, because EWI is unfortunately not yet able to work with inputs that belong to different genres, such as chapters and dialogues. To combine stylistic aspects of two or more different genres of writing represents a very thorny challenge indeed. Endowing EWI with that extra level of flexibility is one of my next major goals.


AAAI News

AI Magazine

AAAI-10 will also include several special tracks, including the Nectar and Senior Member tracks, as well as specific research areas. Call for Papers for the main technical track and other tracks are available at www.aaai.org/aaai10.


Enhancing QA Systems with Complex Temporal Question Processing Capabilities

Journal of Artificial Intelligence Research

This paper presents a multilayered architecture that enhances the capabilities of current QA systems and allows different types of complex questions or queries to be processed. The answers to these questions need to be gathered from factual information scattered throughout different documents. Specifically, we designed a specialized layer to process the different types of temporal questions. Complex temporal questions are first decomposed into simple questions, according to the temporal relations expressed in the original question. In the same way, the answers to the resulting simple questions are recomposed, fulfilling the temporal restrictions of the original complex question. A novel aspect of this approach resides in the decomposition which uses a minimal quantity of resources, with the final aim of obtaining a portable platform that is easily extensible to other languages. In this paper we also present a methodology for evaluation of the decomposition of the questions as well as the ability of the implemented temporal layer to perform at a multilingual level. The temporal layer was first performed for English, then evaluated and compared with: a) a general purpose QA system (F-measure 65.47% for QA plus English temporal layer vs. 38.01% for the general QA system), and b) a well-known QA system. Much better results were obtained for temporal questions with the multilayered system. This system was therefore extended to Spanish and very good results were again obtained in the evaluation (F-measure 40.36% for QA plus Spanish temporal layer vs. 22.94% for the general QA system).



Beyond Audio and Video: Using Claytronics to Enable Pario

AI Magazine

In this article, we describe the hardware and software challenges involved in realizing Claytronics, a form of programmable matter made out of very large numbers-potentially millions-of submillimeter sized spherical robots. The goal of the claytronics project is to create ensembles of cooperating submillimeter  robots, which work together to form dynamic 3D physical objects. For example, claytronics might be used in telepresense to mimic, with high-fidelity and in 3-dimensional solid form, the look, feel, and motion of the person at the other end of the telephone call. To achieve this long-range vision we are investigating hardware mechanisms for constructing submillimeter robots, which can be manufactured en masse using photolithography. We also propose the creation of a new media type, which we call pario. The idea behind pario is to render arbitrary moving, physical 3-dimensional objects that you can see, touch, and even hold in your hands. In parallel with our hardware effort, we are developing novel distributed programming languages and algorithms to control the ensembles, LDP and Meld. Pario may fundamentally change how we communicate with others and interact with the world around us. Our research results to date suggest that there is a viable path to implementing both the hardware and software necessary for claytronics, which is a form of programmable matter that can be used to implement pario. While we have made significant progress, there is still much research ahead in order to turn this vision into reality.


Markov Logic: An Interface Layer for Artificial Intelligence

Morgan & Claypool Publishers

Most subfields of computer science have an interface layer via which applications communicate with the infrastructure, and this is key to their success (e.g., the Internet in networking, the relational model in databases, etc.). So far this interface layer has been missing in AI. First-order logic and probabilistic graphical models each have some of the necessary features, but a viable interface layer requires combining both. Markov logic is a powerful new language that accomplishes this by attaching weights to first-order formulas and treating them as templates for features of Markov random fields. Most statistical models in wide use are special cases of Markov logic, and first-order logic is its infinite-weight limit.