Goto

Collaborating Authors

 Asia


Neural Learning in Structured Parameter Spaces - Natural Riemannian Gradient

Neural Information Processing Systems

Shun-ichi Amari RIKEN Frontier Research Program, RIKEN, Hirosawa 2-1, Wako-shi 351-01, Japan amari@zoo.riken.go.jp Abstract The parameter space of neural networks has a Riemannian metric structure.The natural Riemannian gradient should be used instead of the conventional gradient, since the former denotes the true steepest descent direction of a loss function in the Riemannian space. The behavior of the stochastic gradient learning algorithm is much more effective if the natural gradient is used. The present paper studies the information-geometrical structure of perceptrons and other networks, and prove that the online learning method based on the natural gradient is asymptotically as efficient as the optimal batch algorithm. Adaptive modification of the learning constant is proposed and analyzed in terms of the Riemannian measure andis shown to be efficient. The natural gradient is finally applied to blind separation of mixtured independent signal sources. 1 Introd uction Neural learning takes place in the parameter space of modifiable synaptic weights of a neural network.


Learning Exact Patterns of Quasi-synchronization among Spiking Neurons from Data on Multi-unit Recordings

Neural Information Processing Systems

This paper develops arguments for a family of temporal log-linear models to represent spatiotemporal correlations among the spiking events in a group of neurons. The models can represent not just pairwise correlations but also correlations of higher order. Methods are discussed for inferring the existence or absence of correlations and estimating their strength. A frequentist and a Bayesian approach to correlation detection are compared.


Corpus-Based Approaches to Semantic Interpretation in NLP

AI Magazine

In recent years, there has been a flurry of research into empirical, corpus-based learning approaches to natural language processing (NLP). Most empirical NLP work to date has focused on relatively low-level language processing such as part-of-speech tagging, text segmentation, and syntactic parsing. The success of these approaches has stimulated research in using empirical learning techniques in other facets of NLP, including semantic analysis -- uncovering the meaning of an utterance. This article is an introduction to some of the emerging research in the application of corpus-based learning techniques to problems in semantic interpretation. In particular, we focus on two important problems in semantic interpretation, namely, word-sense disambiguation and semantic parsing.


ICMAS '96: Norms, Obligations, and Conventions

AI Magazine

In adjacent agents from dropping their commitments; (held in Kyoto, Japan, on 10-13 December domains (logical philosophy, social or better, how to regulate 1996). Both the program committee philosophy, decision theory), both legal agents dropping their commitments and the contributors included and social norms have received to a joint action to not disrupt the scientists from different backgrounds considerable, if not satisfactory, attention. The discussion addressed on, has contributed dramatically to These tasks have now entered the several issues: (1) What is the the attention given by the scientific MAS field's common knowledge. Often action is reduced to decision authorization, access regulation, For example, the existence of so-called (that is, a choice among one's privacy maintenance, respect of decency, Georgeff 1991) have shown that we and Tennenholtz 1992). Why? Don't we need a reciprocity.


Dynamic Non-Bayesian Decision Making

Journal of Artificial Intelligence Research

The model of a non-Bayesian agent who faces a repeated game with incomplete information against Nature is an appropriate tool for modeling general agent-environment interactions. In such a model the environment state (controlled by Nature) may change arbitrarily, and the feedback/reward function is initially unknown. The agent is not Bayesian, that is he does not form a prior probability neither on the state selection strategy of Nature, nor on his reward function. A policy for the agent is a function which assigns an action to every history of observations and actions. Two basic feedback structures are considered. In one of them -- the perfect monitoring case -- the agent is able to observe the previous environment state as part of his feedback, while in the other -- the imperfect monitoring case -- all that is available to the agent is the reward obtained. Both of these settings refer to partially observable processes, where the current environment state is unknown. Our main result refers to the competitive ratio criterion in the perfect monitoring case. We prove the existence of an efficient stochastic policy that ensures that the competitive ratio is obtained at almost all stages with an arbitrarily high probability, where efficiency is measured in terms of rate of convergence. It is further shown that such an optimal policy does not exist in the imperfect monitoring case. Moreover, it is proved that in the perfect monitoring case there does not exist a deterministic policy that satisfies our long run optimality criterion. In addition, we discuss the maxmin criterion and prove that a deterministic efficient optimal strategy does exist in the imperfect monitoring case under this criterion. Finally we show that our approach to long-run optimality can be viewed as qualitative, which distinguishes it from previous work in this area.


Applied AI News

AI Magazine

The system generates traffic flow measurements that enable traffic operations centers to monitor traffic movement and better respond to accidents Wal-Mart Stores (Bentonville, Ark.) Tektronix (Wilsonville, Ore.), a and congestion. This system, which manage its automated storage and models for its computer-assisted includes fuzzy logic and neural network retrieval system. The systems will Mexico), a producer of metals, has Calif.) is using visualization and digital monitor satellite signals in near real implemented an intelligent system to prototyping software for vehicle time, alerting operators to out-of-tolerance improve its zinc yield. The advanced design and manufacturing within its conditions and the presence of control expert system provides operator new concurrent engineering system. The application was developed and virtual manufacturing.


Worldwide Perspectives and Trends in Expert Systems: An Analysis Based on the Three World Congresses on Expert Systems

AI Magazine

Some people believe that the expert system field is dead, yet others believe it is alive and well. To gain a better insight into these possible views, the first three world congresses on expert systems (which typically attract representatives from some 45-50 countries) are used to determine the health of the global expert system field in terms of applied technologies, applications, and management. This article highlights some of these findings.


Flaw Selection Strategies for Partial-Order Planning

Journal of Artificial Intelligence Research

Several recent studies have compared the relative efficiency of alternative flaw selection strategies for partial-order causal link (POCL) planning. We review this literature, and present new experimental results that generalize the earlier work and explain some of the discrepancies in it. In particular, we describe the Least-Cost Flaw Repair (LCFR) strategy developed and analyzed by Joslin and Pollack (1994), and compare it with other strategies, including Gerevini and Schubert's (1996) ZLIFO strategy. LCFR and ZLIFO make very different, and apparently conflicting claims about the most effective way to reduce search-space size in POCL planning. We resolve this conflict, arguing that much of the benefit that Gerevini and Schubert ascribe to the LIFO component of their ZLIFO strategy is better attributed to other causes. We show that for many problems, a strategy that combines least-cost flaw selection with the delay of separable threats will be effective in reducing search-space size, and will do so without excessive computational overhead. Although such a strategy thus provides a good default, we also show that certain domain characteristics may reduce its effectiveness.


A Complete Classification of Tractability in RCC-5

Journal of Artificial Intelligence Research

We investigate the computational properties of the spatial algebra RCC-5 which is a restricted version of the RCC framework for spatial reasoning. The satisfiability problem for RCC-5 is known to be NP-complete but not much is known about its approximately four billion subclasses. We provide a complete classification of satisfiability for all these subclasses into polynomial and NP-complete respectively. In the process, we identify all maximal tractable subalgebras which are four in total.