AITopics

doi: 10.1613/jair.199

10150

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.60)

Heckerman, D., Shachter, R.

Decision-Theoretic Foundations for Causal Reasoning

Journal of Artificial Intelligence ResearchDec-1-1995

We present a definition of cause and effect in terms of decision-theoretic primitives and thereby provide a principled foundation for causal reasoning. Our definition departs from the traditional view of causation in that causal assertions may vary with the set of decisions available. We argue that this approach provides added clarity to the notion of cause. Also in this paper, we examine the encoding of causal relationships in directed acyclic graphs. We describe a special class of influence diagrams, those in canonical form, and show its relationship to Pearl's representation of cause and effect. Finally, we show how canonical form facilitates counterfactual reasoning.

artificial intelligence, causal reasoning, decision-theoretic foundation

doi: 10.1613/jair.202

10151

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (0.60)

Journal of Artificial Intelligence ResearchDec-1-1995

Statistical Feature Combination for the Evaluation of Game Positions

Buro, M.

This article describes an application of three well-known statistical methods in the field of game-tree search: using a large number of classified Othello positions, feature weights for evaluation functions with a game-phase-independent meaning are estimated by means of logistic regression, Fisher's linear discriminant, and the quadratic discriminant function for normally distributed features. Thereafter, the playing strengths are compared by means of tournaments between the resulting versions of a world-class Othello program. In this application, logistic regression - which is used here for the first time in the context of game playing - leads to better results than the other approaches.

artificial intelligence, ion, machine learning, (13 more...)

doi: 10.1613/jair.179

10146

Country: North America > United States (0.14)

Industry: Leisure & Entertainment > Games (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Journal of Artificial Intelligence ResearchDec-1-1995

Generalization of Clauses under Implication

Idestam-Almquist, P.

In the area of inductive learning, generalization is a main operation, and the usual definition of induction is based on logical implication. Recently there has been a rising interest in clausal representation of knowledge in machine learning. Almost all inductive learning systems that perform generalization of clauses use the relation theta-subsumption instead of implication. The main reason is that there is a well-known and simple technique to compute least general generalizations under theta-subsumption, but not under implication. However generalization under theta-subsumption is inappropriate for learning recursive clauses, which is a crucial problem since recursion is the basic program structure of logic programs. We note that implication between clauses is undecidable, and we therefore introduce a stronger form of implication, called T-implication, which is decidable between clauses. We show that for every finite set of clauses there exists a least general generalization under T-implication. We describe a technique to reduce generalizations under implication of a clause to generalizations under theta-subsumption of what we call an expansion of the original clause. Moreover we show that for every non-tautological clause there exists a T-complete expansion, which means that every generalization under T-implication of the clause is reduced to a generalization under theta-subsumption of the expansion.

artificial intelligence, io iac ep9gqr 569, machine learning, (13 more...)

doi: 10.1613/jair.194

10149

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.44)

Huffman, S. B., Laird, J. E.

Flexibly Instructable Agents

Journal of Artificial Intelligence ResearchNov-1-1995

This paper presents an approach to learning from situated, interactive tutorial instruction within an ongoing agent. Tutorial instruction is a flexible (and thus powerful) paradigm for teaching tasks because it allows an instructor to communicate whatever types of knowledge an agent might need in whatever situations might arise. To support this flexibility, however, the agent must be able to learn multiple kinds of knowledge from a broad range of instructional interactions. Our approach, called situated explanation, achieves such learning through a combination of analytic and inductive techniques. It combines a form of explanation-based learning that is situated for each instruction with a full suite of contextually guided responses to incomplete explanations. The approach is implemented in an agent called Instructo-Soar that learns hierarchies of new tasks and other domain knowledge from interactive natural language instructions. Instructo-Soar meets three key requirements of flexible instructability that distinguish it from previous systems: (1) it can take known or unknown commands at any instruction point; (2) it can handle instructions that apply to either its current situation or to a hypothetical situation specified in language (as in, for instance, conditional instructions); and (3) it can learn, from instructions, each class of knowledge it uses to perform tasks.

artificial intelligence, educational setting, instructable agent, (10 more...)

doi: 10.1613/jair.150

10145

Genre: Instructional Material > Course Syllabus & Notes (0.53)

Industry: Education > Educational Setting > Online (0.53)

Technology: Information Technology > Artificial Intelligence (0.91)

Journal of Artificial Intelligence ResearchOct-1-1995

Learning Membership Functions in a Function-Based Object Recognition System

Woods, K., Cook, D., Hall, L., Bowyer, K., Stark, L.

Functionality-based recognition systems recognize objects at the category level by reasoning about how well the objects support the expected function. Such systems naturally associate a ``measure of goodness'' or ``membership value'' with a recognized object. This measure of goodness is the result of combining individual measures, or membership values, from potentially many primitive evaluations of different properties of the object's shape. A membership function is used to compute the membership value when evaluating a primitive of a particular physical property of an object. In previous versions of a recognition system known as Gruff, the membership function for each of the primitive evaluations was hand-crafted by the system designer. In this paper, we provide a learning component for the Gruff system, called Omlet, that automatically learns membership functions given a set of example objects labeled with their desired category measure. The learning algorithm is generally applicable to any problem in which low-level membership values are combined through an and-or tree structure to give a final overall membership value.

artificial intelligence, desired, machine learning, (12 more...)

doi: 10.1613/jair.236

10144

Technology:

Information Technology > Artificial Intelligence > Vision (0.40)
Information Technology > Artificial Intelligence > Machine Learning (0.40)

Journal of Artificial Intelligence ResearchOct-1-1995

Improving Connectionist Energy Minimization

Pinkas, G., Dechter, R.

Symmetric networks designed for energy minimization such as Boltzman machines and Hopfield nets are frequently investigated for use in optimization, constraint satisfaction and approximation of NP-hard problems. Nevertheless, finding a global solution (i.e., a global minimum for the energy function) is not guaranteed and even a local solution may take an exponential number of steps. We propose an improvement to the standard local activation function used for such networks. The improved algorithm guarantees that a global minimum is found in linear time for tree-like subnetworks. The algorithm, called activate, is uniform and does not assume that the network is tree-like. It can identify tree-like subnetworks even in cyclic topologies (arbitrary networks) and avoid local minima along these trees. For acyclic networks, the algorithm is guaranteed to converge to a global minimum from any initial state of the system (self-stabilization) and remains correct under various types of schedulers. On the negative side, we show that in the presence of cycles, no uniform algorithm exists that guarantees optimality even under a sequential asynchronous scheduler. An asynchronous scheduler can activate only one unit at a time while a synchronous scheduler can activate any number of units in a single time step. In addition, no uniform algorithm exists to optimize even acyclic networks when the scheduler is synchronous. Finally, we show how the algorithm can be improved using the cycle-cutset scheme. The general algorithm, called activate-with-cutset, improves over activate and has some performance guarantees that are related to the size of the network's cycle-cutset.

connectionist energy minimization, constraint-based reasoning, machine learning, (2 more...)

doi: 10.1613/jair.130

10142

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.53)
Information Technology > Artificial Intelligence > Machine Learning (0.53)

Journal of Artificial Intelligence ResearchOct-1-1995

Diffusion of Context and Credit Information in Markovian Models

Bengio, Y., Frasconi, P.

This paper studies the problem of ergodicity of transition probability matrices in Markovian models, such as hidden Markov models (HMMs), and how it makes very difficult the task of learning to represent long-term context for sequential data. This phenomenon hurts the forward propagation of long-term context information, as well as learning a hidden state representation to represent long-term context, which depends on propagating credit information backwards in time. Using results from Markov chain theory, we show that this problem of diffusion of context and credit is reduced when the transition probabilities approach 0 or 1, i.e., the transition probability matrices are sparse and the model essentially deterministic. The results found in this paper apply to learning approaches based on continuous optimization, such as gradient descent and the Baum-Welch algorithm.

artificial intelligence, context and credit information, machine learning, (16 more...)

doi: 10.1613/jair.233

10143

Genre: Research Report (0.53)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

artificial intelligence, management and information, National Information Infrastructure, (4 more...)

The Role of Intelligent Systems in the National Information Infrastructure

Weld, Daniel S.

AI MagazineSep-15-1995

This report stems from a workshop that was organized by the Association for the Advancement of Artificial Intelligence (AAAI) and cosponsored by the Information Technology and Organizations Program of the National Science Foundation. The purpose of the workshop was twofold: first, to increase awareness among the artificial intelligence (AI) community of opportunities presented by the National Information Infrastructure (NII) activities, in particular, the Information Infrastructure and Tech-nology Applications (IITA) component of the High Performance Computing and Communications Program; and second, to identify key contributions of research in AI to the NII and IITA.

AI Magazine

Industry: Information Technology (0.75)

Technology: Information Technology > Artificial Intelligence (1.00)

AAAI Spring Symposia Report, artificial intelligence, management and information

The 1995 AAAI Spring Symposia Reports

AAAI,

AI MagazineSep-15-1995

The Association for the Advancement of Artificial Intelligence held its 1995 Spring Symposium Series on March 27 to 29 at Stanford University. This article contains summaries of the nine symposia that were conducted: (1) Empirical Methods in Discourse Interpretation and Generation; (2) Extending Theories of Action: Formal Theory and Practical Applications; (3) Information Gathering from Heterogeneous, Distributed Environments; (4) Integrated Planning Applications; (5) Interactive Story Systems: Plot and Character; (6) Lessons Learned from Implemented Software Architectures for Physical Agents; (7) Representation and Acquisition of Lexical Knowledge: Polysemy, Ambiguity, and Generativity; (8) Representing Mental States and Mechanisms; and (9) Systematic Methods of Scientific Discovery.

AI Magazine

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.73)