AITopics

We have explored the use of analogy as a general approach to near and far transfer learning in domains ranging from physics problem solving to strategy games (Klenk and Forbus 2007; Hinrichs and Forbus 2007). Using the same basic analogical mechanism, we have found that the main differences between near and far transfer involve the amount of generalization that must be performed prior to transfer and the way that the matching process treats nonidentical predicates. We present here two extensions of our analogical matcher, minimal ascension and metamapping, that enable far transfer between representations with different relational vocabulary. Evidence for the effectiveness of these techniques is provided by a large-scale external evaluation, involving a substantial number of novel distant analogs.

artificial intelligence, machine learning, predicate, (17 more...)

Country: North America > United States > California (0.28)

Industry:

Leisure & Entertainment > Games (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Government > Military (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Analogical Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.93)

Klenk, Matthew (Navy Center for Applied Research in Artificial Intelligence) | Aha, David W. (Navy Center for Applied Research in Artificial Intelligence) | Molineaux, Matt (Knexus Research Corporation)

The Case for Case-Based Transfer Learning

Case-based reasoning (CBR) is a problem-solving process in which a new problem is solved by retrieving a similar situation and reusing its solution. Transfer learning occurs when, after gaining experience from learning how to solve source problems, the same learner exploits this experience to improve performance and/or learning on target problems. In transfer learning, the differences between the source and target problems characterize the transfer distance. CBR can support transfer learning methods in multiple ways. We illustrate how CBR and transfer learning interact and characterize three approaches for using CBR in transfer learning: (1) as a transfer learning method, (2) for problem learning, and (3) to transfer knowledge between sets of problems. We describe examples of these approaches from our own and related work and discuss applicable transfer distances for each. We close with conclusions and directions for future research applying CBR to transfer learning.

artificial intelligence, machine learning, transfer distance, (16 more...)

Country:

North America > United States (1.00)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.28)

Genre: Overview (0.49)

Industry:

Leisure & Entertainment > Sports > Football (1.00)
Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Case-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Memory-Based Learning (1.00)

Davis, Jesse (Katholieke Universiteit Leuven) | Domingos, Pedro (University of Washington)

Deep Transfer: A Markov Logic Approach

This article argues that currently the largest gap between human and machine learning is learning algorithms' inability to perform deep transfer, that is, generalize from one domain to another domain containing different objects, classes, properties and relations. We argue that second-order Markov logic is ideally suited for this purpose, and propose an approach based on it. Our algorithm discovers structural regularities in the source domain in the form of Markov logic formulas with predicate variables, and instantiates these formulas with predicates from the target domain. Our approach has successfully transferred learned knowledge among molecular biology, Web and social network domains.

artificial intelligence, formula, machine learning, (14 more...)

Country: North America > United States > Wisconsin (0.15)

Genre: Personal > Honors (0.48)

Industry:

Information Technology (0.50)
Government (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.84)

An Introduction to Intertask Transfer for Reinforcement Learning

Taylor, Matthew E. (Lafayette College) | Stone, Peter (University of Texas at Austin)

Transfer learning has recently gained popularity due to the development of algorithms that can successfully generalize information across multiple tasks. This article focuses on transfer in the context of reinforcement learning domains, a general learning framework where an agent acts in an environment to maximize a reward signal. The goals of this article are to (1) familiarize readers with the transfer learning problem in reinforcement learning domains, (2) explain why the problem is both interesting and difﬁcult, (3) present a selection of existing techniques that demonstrate different solutions, and (4) provide representative open problems in the hope of encouraging additional research in this exciting area.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Country:

North America > Canada (0.67)
North America > United States > California (0.46)

Genre: Research Report > New Finding (0.46)

Industry:

Leisure & Entertainment > Sports > Soccer (1.00)
Education (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Shapiro, Daniel G. (Institute for the Study of Learning and Expertise) | Munoz-Avila, Hector (Lehigh University) | Stracuzzi, David (Sandia National Laboratories)

The Special Issue of AI Magazine on Structured Knowledge Transfer

This issue summarizes the state of the art in structured knowledge transfer, which is an emerging approach to the general problem of knowledge acquisition and reuse. Its goal is to capture, in a general form, the internal structure of the objects, relations, strategies, and processes used to solve tasks drawn from a source domain, and exploit that knowledge to improve performance in a target domain.

artificial intelligence, knowledge management, machine learning, (15 more...)

Country: North America > United States (0.48)

Genre: Collection > Journal (0.32)

Industry: Government (0.49)

Technology:

Information Technology > Knowledge Management > Knowledge Engineering (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning > Case-Based Reasoning (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Memory-Based Learning (0.50)

Marriott, Chris, Gershenson, Carlos

Polyethism in a colony of artificial ants

arXiv.org Artificial IntelligenceApr-15-2011

We explore self-organizing strategies for role assignment in a foraging task carried out by a colony of artificial agents. Our strategies are inspired by various mechanisms of division of labor (polyethism) observed in eusocial insects like ants, termites, or bees. Specifically we instantiate models of caste polyethism and age or temporal polyethism to evaluated the benefits to foraging in a dynamic environment. Our experiment is directly related to the exploration/exploitation trade of in machine learning.

artificial intelligence, machine learning, polyethism, (19 more...)

arXiv.org Artificial Intelligence

1104.3152

Country: North America > Mexico > Mexico City (0.14)

Genre: Research Report (0.82)

Industry: Energy > Oil & Gas (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Sutton, Charles, Jordan, Michael I.

Bayesian inference for queueing networks and modeling of internet services

arXiv.org Machine LearningApr-15-2011

Modern Internet services, such as those at Google, Yahoo!, and Amazon, handle billions of requests per day on clusters of thousands of computers. Because these services operate under strict performance requirements, a statistical understanding of their performance is of great practical interest. Such services are modeled by networks of queues, where each queue models one of the computers in the system. A key challenge is that the data are incomplete, because recording detailed information about every request to a heavily used system can require unacceptable overhead. In this paper we develop a Bayesian perspective on queueing models in which the arrival and departure times that are not observed are treated as latent variables. Underlying this viewpoint is the observation that a queueing model defines a deterministic transformation between the data and a set of independent variables called the service times. With this viewpoint in hand, we sample from the posterior distribution over missing data and model parameters using Markov chain Monte Carlo. We evaluate our framework on data from a benchmark Web application. We also present a simple technique for selection among nested queueing models. We are unaware of any previous work that considers inference in networks of queues in the presence of missing data.

artificial intelligence, bayesian inference, machine learning, (16 more...)

arXiv.org Machine Learning

doi: 10.1214/10-AOAS392

1001.3355

Country: North America > United States > California (0.46)

Genre: Research Report (1.00)

Industry:

Consumer Products & Services > Travel (0.39)
Information Technology > Services (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Communications > Networks (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

arXiv.org Machine LearningApr-14-2011

Slicing: Nonsingular Estimation of High Dimensional Covariance Matrices Using Multiway Kronecker Delta Covariance Structures

Akdemir, Deniz

Nonsingular estimation of high dimensional covariance matrices is an important step in many statistical procedures like classification, clustering, variable selection an future extraction. After a review of the essential background material, this paper introduces a technique we call slicing for obtaining a nonsingular covariance matrix of high dimensional data. Slicing is essentially assuming that the data has Kronecker delta covariance structure. Finally, we discuss the implications of the results in this paper and provide an example of classification for high dimensional gene expression data.

artificial intelligence, machine learning, matrix, (16 more...)

arXiv.org Machine Learning

1104.1767

Country: North America > United States (0.46)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

arXiv.org Artificial IntelligenceApr-13-2011

Foundations for Understanding and Building Conscious Systems using Stable Parallel Looped Dynamics

Ravuri, Muralidhar

The problem of consciousness faced several challenges for a few reasons: (a) a lack of necessary and sufficient conditions, without which we would not know how close we are to the solution, (b) a lack of a synthesis framework to build conscious systems and (c) a lack of mechanisms explaining the transition between the lower-level chemical dynamics and the higher-level abstractions. In this paper, I address these issues using a new framework. The central result is that a person is 'minimally' conscious if and only if he knows at least one truth. This lets us move away from the vagueness surrounding consciousness and instead focus equivalently on: (i) what truths are and how our brain represents/relates them to each other and (ii) how we attain a feeling of knowing for a truth. For the former problem, since truths are things that do not change, I replace the abstract notion with a dynamical one called fixed sets. These sets are guaranteed to exist for our brain and other stable parallel looped systems. The relationships between everyday events are now built using relationships between fixed sets, until our brain creates a unique dynamical state called the self-sustaining threshold 'membrane' of fixed sets. For the latter problem, I present necessary and sufficient conditions for attaining a feeling of knowing using a definition of continuity applied to abstractions. Combining these results, I now say that a person is minimally conscious if and only if his brain has a self-sustaining dynamical membrane with abstract continuous paths. A synthetic system built to satisfy this equivalent self-sustaining membrane condition appears indistinguishable from human consciousness.

artificial intelligence, consciousness, machine learning, (17 more...)

arXiv.org Artificial Intelligence

1102.368

Country: North America > United States (0.45)

Genre: Research Report (0.50)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Philosophy (0.86)

Shimizu, Shohei, Washio, Takashi, Hyvarinen, Aapo, Imoto, Seiya

Finding Exogenous Variables in Data with Many More Variables than Observations

arXiv.org Machine LearningApr-7-2011

Many statistical methods have been proposed to estimate causal models in classical situations with fewer variables than observations (p>n). In this paper, we propose a method to find exogenous variables in a linear non-Gaussian causal model, which requires much smaller sample sizes than conventional methods and works even when p>>n. The key idea is to identify which variables are exogenous based on non-Gaussianity instead of estimating the entire structure of the model. Exogenous variables work as triggers that activate a causal chain in the model, and their identification leads to more efficient experimental designs and better understanding of the causal mechanism. We present experiments with artificial data and real-world gene expression data to evaluate the method.

artificial intelligence, bioinformatics, machine learning, (18 more...)

arXiv.org Machine Learning

doi: 10.1007/978-3-642-15819-3_10

0904.0838

Country: Asia > Japan > Honshū (0.28)

Genre: Research Report (1.00)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Biomedical Informatics (0.95)