AITopics

Empirical findings indicate that humans draw infer- ences about spatial arrangements by constructing and manipulating mental models which are internal representations of objects and relations in spatial working memory. Central to the Mental Model Theory (MMT), is the assumption that the human reasoning process can be divided into three phases: (i) Mental model construction, (ii) model inspection, and (iii) model validation. The MMT can be formalized with respect to a computational model, connecting the reasoning process to operations on mental model representations. In this respect a computational model has been implemented in the cognitive architecture ACT-R capable of explaining human reasoning difficulty by the number of model operations. The presented ACT-R model allows simulation of psychological findings about spatial reasoning problems from a previous study that investigated conventional behavioral data such as response times and error rates in the context of certain mental model construction principles.

mental model, neural network, spatial reasoning, (19 more...)

Twenty-Fifth AAAI Conference on Artificial Intelligence

Country: North America > United States > Massachusetts > Middlesex County (0.14)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.49)

Xu, Xiaoxi (University of Massachusetts Amherst)

Discovering Latent Strategies

Strategy mining is a new area of research about discovering strategies in decision-making. In this paper, we formulate the strategy-mining problem as a clustering problem, called the latent-strategy problem. In a latent-strategy problem, a corpus of data instances is given, each of which is represented by a set of features and a decision label. The inherent dependency of the decision label on the features is governed by a latent strategy. The objective is to find clusters, each of which contains data instances governed by the same strategy. Existing clustering algorithms are inappropriate to cluster dependency because they either assume feature independency (e.g., K-means) or only consider the co-occurrence of features without explicitly modeling the special dependency of the decision label on other features (e.g., Latent Dirichlet Allocation (LDA)). In this paper, we present a baseline unsupervised learning algorithm for dependency clustering. Our model-based clustering algorithm iterates between an assignment step and a minimization step to learn a mixture of decision tree models that represent latent strategies. Similar to the Expectation Maximization algorithm, our algorithm is grounded in the statistical learning theory. Different from other clustering algorithms, our algorithm is irrelevant-feature resistant and its learned clusters (modeled by decision trees) are strongly interpretable and predictive. We systematically evaluate our algorithm using a common law dataset comprised of actual cases. Experimental results show that our algorithm significantly outperforms K-means and LDA on clustering dependency.

decision tree, health & medicine, oncology, (19 more...)

Twenty-Fifth AAAI Conference on Artificial Intelligence

Country: North America > United States > Massachusetts (0.15)

Genre: Research Report > New Finding (0.35)

Industry:

Law (1.00)
Health & Medicine > Therapeutic Area (0.32)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Zhang, Chongjie (University of Massachusetts Amherst) | Lesser, Victor (University of Massachusetts Amherst)

Coordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs

In many multi-agent applications such as distributed sensor nets, a network of agents act collaboratively under uncertainty and local interactions. Networked Distributed POMDP (ND-POMDP) provides a framework to model such cooperative multi-agent decision making. Existing work on ND-POMDPs has focused on offline techniques that require accurate models, which are usually costly to obtain in practice. This paper presents a model-free, scalable learning approach that synthesizes multi-agent reinforcement learning (MARL) and distributed constraint optimization (DCOP). By exploiting structured interaction in ND-POMDPs, our approach distributes the learning of the joint policy and employs DCOP techniques to coordinate distributed learning to ensure the global learning performance. Our approach can learn a globally optimal policy for ND-POMDPs with a property called groupwise observability. Experimental results show that, with communication during learning and execution, our approach significantly outperforms the nearly-optimal non-communication policies computed offline.

artificial intelligence, nd-pomdp, reinforcement learning, (17 more...)

Twenty-Fifth AAAI Conference on Artificial Intelligence

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Dodds, Zachary (Harvey Mudd College)

Can Quadrotors Succeed as an Educational Platform?

That drone and its basic capabilities are summarized in Figure 1. The flexibility and controllability of quadrotor helicopters have made them a recent focus of interest among robotics and AI research groups. At the same time, their popularity has led to a wide range of commercially available platforms, some at prices accessible for undergraduate educational use. This project evaluates the ARDrone quadrotor helicopter as a basis for use in undergraduate classes such as robotics, computer vision, or embodied AI. We have encountered both successes and frustrations in using the ARDrone to date. Looking forward, the quadrotor's capabilities do seem a promising basis for future curricular offerings.

ardrone, artificial intelligence, educational setting, (14 more...)

Twenty-Fifth AAAI Conference on Artificial Intelligence

Country: North America > United States (0.30)

Industry: Education (0.90)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Irfan, Mohammad Tanvir (Stony Brook University) | Ortiz, Luis Enrique (Stony Brook University)

A Game-Theoretic Approach to Influence in Networks

We propose influence games, a new class of graphical games, as a model of the behavior of large but finite networked populations. Grounded in non-cooperative game theory, we introduce a new approach to the study of influence in networks that captures the strategic aspects of complex interactions in the network. We study computational problems on influence games, including the identification of the most influential nodes. We characterize the computational complexity of various problems in influence games, propose several heuristics for the hard cases, and design approximation algorithms, with provable guarantees, for the most influential nodes problem.

artificial intelligence, game theory, psne, (17 more...)

Twenty-Fifth AAAI Conference on Artificial Intelligence

Country: North America > United States > New York (0.14)

Industry:

Leisure & Entertainment > Games (0.67)
Government (0.46)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)

Automatic Group Sparse Coding

Wang, Fei (IBM Research) | Lee, Noah (Columbia University) | Sun, Jimeng (IBM Research) | Hu, Jianying (IBM Research) | Ebadollahi, Shahram (IBM Research)

Sparse Coding (SC), which models the data vectors as sparse linear combinations over basis vectors (i.e., dictionary), has been widely applied in machine learning, signal processing and neuroscience. Recently, one specific SC technique, Group Sparse Coding (GSC), has been proposed to learn a common dictionary over multiple different groups of data, where the data groups are assumed to be pre-defined. In practice, this may not always be the case. In this paper, we propose Automatic Group Sparse Coding (AutoGSC), which can (1) discover the hidden data groups; (2) learn a common dictionary over different data groups; and (3) learn an individual dictionary for each data group. Finally, we conduct experiments on both synthetic and real world data sets to demonstrate the effectiveness of AutoGSC, and compare it with traditional sparse coding and Nonnegative Matrix Factorization (NMF) methods.

data group, optimization problem, vascular disease, (21 more...)

Twenty-Fifth AAAI Conference on Artificial Intelligence

Country: North America > United States (0.14)

Genre: Research Report (0.68)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Lossy Conservative Update (LCU) Sketch: Succinct Approximate Count Storage

Goyal, Amit (University of Maryland) | Daume, Hal (University of Maryland)

In this paper, we propose a variant of the conservativeupdate Count-Min sketch to further reduce the overestimation error incurred. Inspired by ideas from lossy counting, we divide a stream of items into multiple windows, and decrement certain counts in the sketch at window boundaries. We refer to this approach as a lossy conservative update (LCU). The reduction in overestimation error of counts comes at the cost of introducing under-estimation error in counts. However, in our intrinsic evaluations, we show that the reduction in overestimation is much greater than the under-estimation error introduced by our method LCU. We apply our LCU framework to scale distributional similarity computations to web-scale corpora. We show that this technique is more efficient in terms of memory, and time, and more robust than conservative update with Count-Min (CU) sketch on this task.

artificial intelligence, natural language, sketch, (13 more...)

Twenty-Fifth AAAI Conference on Artificial Intelligence

Country: North America > United States > Maryland > Prince George's County > College Park (0.14)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Singla, Parag (University of Texas at Austin) | Mooney, Raymond J. (University of Texas at Austin)

Abductive Markov Logic for Plan Recognition

Plan recognition is a form of abductive reasoning that involves inferring plans that best explain sets of observed actions. Most existing approaches to plan recognition and other abductive tasks employ either purely logical methods that donot handle uncertainty, or purely probabilistic methods thatdo not handle structured representations. To overcome these limitations, this paper introduces an approach to abductive reasoning using a ﬁrst-order probabilistic logic, speciﬁcally Markov Logic Networks (MLNs). It introduces several novel techniques for making MLNs efﬁcient and effective for abduction. Experiments on three plan recognition datasets showthe beneﬁt of our approach over existing methods.

explanation, plan recognition, planning & scheduling, (18 more...)

Twenty-Fifth AAAI Conference on Artificial Intelligence

Country:

North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > California > San Mateo County (0.14)

Genre: Research Report > Promising Solution (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling > Plan Recognition (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Abductive Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

A Simple and Effective Unsupervised Word Segmentation Approach

Chen, Songjian (Sun Yat-sen University) | Xu, Yabo (Sun Yat-sen University) | Chang, Huiyou (Sun Yat-sen Universit)

In this paper, we propose a new unsupervised approach for word segmentation. The core idea of our approach is a novel word induction criterion called WordRank, which estimates the goodness of word hypotheses (character or phoneme sequences). We devise a method to derive exterior word boundary information from the link structures of adjacent word hypotheses and incorporate interior word boundary information to complete the model. In light of WordRank, word segmentation can be modeled as an optimization problem. A Viterbi-styled algorithm is developed for the search of the optimal segmentation. Extensive experiments conducted on phonetic transcripts as well as standard Chinese and Japanese data sets demonstrate the effectiveness of our approach. On the standard Brent version of Bernstein-Ratner corpora, our approach outperforms the state-of-the-art Bayesian models by more than 3%. Plus, our approach is simpler and more efficient than the Bayesian methods. Consequently, our approach is more suitable for real-world applications.

When to Stop? That Is the Question

Reches, Shulamit (Jerusalem College of Technology) | Kalech, Meir (Ben-Gurion University) | Stern, Rami (Ben-Gurion University)

When to make a decision is a key question in decision making problems characterized by uncertainty. In this paper we deal with decision making in environments where the information arrives dynamically. We address the tradeoff between waiting and stopping strategies. On the one hand, waiting to obtain more information reduces the uncertainty, but it comes with a cost. On the other hand, stopping and making a decision based on an expected utility, decreases the cost of waiting, but the decision is made based on uncertain information. In this paper, we prove that computing the optimal time to make a decision that guarantees the optimal utility is NP-hard. We propose a pessimistic approximation that guarantees an optimal decision when the recommendation is to wait. We empirically evaluate our algorithm and show that the quality of the decision is near-optimal and much faster than the optimal algorithm.

algorithm, artificial intelligence, banking & finance, (18 more...)

Twenty-Fifth AAAI Conference on Artificial Intelligence

Country:

Asia > Middle East > Israel (0.14)
North America > United States (0.14)

Industry: Banking & Finance (0.68)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)