AITopics

Inst., 19600 NW vonNeumann Dr, Beaverton, OR 97006 Abstract We propose a very simple, and well principled way of computing the optimal step size in gradient descent algorithms. The online version is very efficient computationally, and is applicable to large backpropagation networks trained on large data sets. The main ingredient is a technique for estimating the principal eigenvalue(s) and eigenvector(s) of the objective function's second derivative matrix (Hessian),which does not require to even calculate the Hessian. Severalother applications of this technique are proposed for speeding up learning, or for eliminating useless parameters. 1 INTRODUCTION Choosing the appropriate learning rate, or step size, in a gradient descent procedure such as backpropagation, is simultaneously one of the most crucial and expertintensive partof neural-network learning. We propose a method for computing the best step size which is both well-principled, simple, very cheap computationally, and, most of all, applicable to online training with large networks and data sets.

artificial intelligence, hessian, neural network, (14 more...)

Country:

North America > United States > Oregon > Washington County > Beaverton (0.24)
North America > United States > Massachusetts > Hampshire County (0.14)

Industry: Education (0.74)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.58)

Siu, Kai-Yeung, Roychowdhury, Vwani, Kailath, Thomas

Computing with Almost Optimal Size Neural Networks

Artificial neural networks are comprised of an interconnected collection of certain nonlinear devices; examples of commonly used devices include linear threshold elements, sigmoidal elements and radial-basis elements. We employ results from harmonic analysis and the theory of rational approximation toobtain almost tight lower bounds on the size (i.e.

artificial intelligence, neural network, threshold circuit, (16 more...)

Country:

North America > United States > Indiana > Tippecanoe County (0.14)
North America > United States > California > Orange County > Irvine (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Das, Sreerupa, Giles, C. Lee, Sun, Guo-Zheng

Using Prior Knowledge in a NNPDA to Learn Context-Free Languages

Language inference and automata induction using recurrent neural networks has gained considerable interest in the recent years. Nevertheless, success of these models hasbeen mostly limited to regular languages. Additional information in form of a priori knowledge has proved important and at times necessary for learning complex languages(Abu-Mostafa 1990; AI-Mashouq and Reed, 1991; Omlin and Giles, 1992; Towell, 1990). They have demonstrated that partial information incorporated in a connectionist model guides the learning process through constraints for efficient learning and better generalization. 'Ve have previously shown that the NNPDA model can learn Deterministic Context 65 66 Das, Giles, and Sun Output

artificial intelligence, neural network, nnpda, (13 more...)

Country:

North America > United States > Maryland > Prince George's County > College Park (0.14)
North America > United States > Colorado > Boulder County > Boulder (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Input Reconstruction Reliability Estimation

Pomerleau, Dean A.

This paper describes a technique called Input Reconstruction Reliability Estimation (IRRE) for determining the response reliability of a restricted class of multi-layer perceptrons (MLPs). The technique uses a network's ability to accurately encode the input pattern in its internal representation as a measure of its reliability. The more accurately a network is able to reconstruct the input pattern from its internal representation, the more reliable the network is considered to be. IRRE is provides a good estimate of the reliability of MLPs trained for autonomous driving. Results are presented in which the reliability estimates provided by IRRE are used to select between networks trained for different driving situations. 1 Introduction In many real world domains it is important to know the reliability of a network's response since a single network cannot be expected to accurately handle all the possible inputs.

artificial intelligence, neural network, representation, (16 more...)

Country: North America > United States (0.69)

Industry:

Transportation > Ground > Road (0.36)
Information Technology (0.36)
Automobiles & Trucks (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.54)

A Hybrid Neural Net System for State-of-the-Art Continuous Speech Recognition

Zavaliagkos, G., Zhao, Y., Schwartz, R., Makhoul, J.

Untill recently, state-of-the-art, large-vocabulary, continuous speech recognition (CSR) has employed Hidden Markov Modeling (HMM) to model speech sounds. In an attempt to improve over HMM we developed a hybrid system that integrates HMM technology with neural We present the concept of a "Segmental Neural Net"networks.

hybrid system, neural network, speech recognition, (18 more...)

Country: North America > United States > Massachusetts (0.29)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.49)

Dayan, Peter, Hinton, Geoffrey E.

Feudal Reinforcement Learning

One way to speed up reinforcement learning is to enable learning to happen simultaneously at multiple resolutions in space and time. This paper shows how to create a Q-Iearning managerial hierarchy how to set tasks to their submanagersin which high level managers learn how to satisfy them. Sub-managerswho, in turn, learn understand their managers' commands. Theyneed not initially simply learn to maximise their reinforcement in the context of the current command. We illustrate the system using a simple maze task .. As the system learns how to get around, satisfying commands at the multiple than standard, flat, Q-Iearninglevels, it explores more efficiently and builds a more comprehensive map. 1 INTRODUCTION Straightforward reinforcement learning has been quite successful at some relatively thecomplex tasks like playing backgammon (Tesauro, 1992).

agent, artificial intelligence, reinforcement learning, (16 more...)

Country:

North America > United States (0.69)
North America > Canada > Ontario > Toronto (0.15)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Green Engineering AI Tools Benefit the Environment

Hedberg, Sara R.

artificial intelligence, environmental compatibility, green engineering ai tool benefit, (13 more...)

For over a decade now, AI techniques have been applied to some of the hardest problems faced by business today, often with stellar results and a tenfold-plus return on investment. One of the major problems faced by businesses in the 1990s is how to produce environmentally friendly products and stay profitable. A pioneering consortium at Carnegie Mellon University (CMU) is using AI, combined with operations research, environmental science, public policy, and other disciplines, to build tools for green engineering. Green engineering is an approach to product development that balances environmental compatibility against economic profitability. It looks at the entire life cycle of the product, from design to disposal, and seeks to extend this life cycle through remanufacturing, reusing, and recycling products and components.

Country: North America > United States > Florida > Alachua County > Gainesville (0.17)

Industry: Law (0.35)

Technology:

Information Technology > Artificial Intelligence > Applied AI (0.35)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.31)

Burton, Stu, Swanson, Kent, Leonard, Lisa

Quality and Knowledge in Software Engineering

Celite corporation and Andersen Consulting have developed an advanced approach to traditional software development called the application software factory (ASF)." The approach is an integration of technology and total quality "management" techniques that includes the use of an expert system to guide module design and perform "module programming." The expert system component is called the knowledge-based design assistant and its inclusion in the ASF methodology" has significantly reduced module development time, training time, and module and communication errors.

expert system, module, software engineering, (20 more...)

Country:

North America > United States > California (0.14)
North America > United States > Colorado (0.14)

Industry: Energy > Oil & Gas (0.94)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Knowledge Management (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)

Similarity in Cognition: A Review of Similarity and Analogical Reasoning

Sun, Ron

Analogical although analogy can help, as note that although still in its infancy reasoning is thus achieved in such well as hamper, learning. The role of and somewhat simplistic in character, systems by mainly keeping the analogy in learning is discussed by connectionist research might prove abstract relational microfeatures. Ann Brown and by Rand Spiro et al., to have an edge in tackling these Rumelhart proposes another way and the role of analogy in knowledge problems. The research described in for achieving analogical reasoning, acquisition is discussed by Brian Ross this book presents a grand challenge that is, "soft clamp," in which input and by John Bransford et al.; Stella and a future prospect for AI clamps can be overridden, and the Vosniadou studies the developmental researchers (traditional or connectionistic) rule of thumb is that the more concrete change in the use of analogy. Because in their endeavor to find a a feature is, the easier it can be part 3 of the book is of marginal better and more cognitively plausible overridden. The system finds the interest to AI, I do not discuss it any representation scheme.

book review, neural network, representation, (17 more...)

Country: North America > United States > Minnesota > Hennepin County > Minneapolis (0.15)

Genre:

Summary/Review (1.00)
Collection > Book (0.49)
Research Report > Experimental Study (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Analogical Reasoning (0.87)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.68)

Hanks, Steve, Pollack, Martha E., Cohen, Paul R.

Benchmarks, Test Beds, Controlled Experimentation, and the Design of Agent Architectures

The methodological underpinnings of AI are slowly changing. Benchmarks, test beds, and controlled experimentation are becoming more common. Although we are optimistic that this change can solidify the science of AI, we also recognize a set of difficult issues concerning the appropriate use of this methodology. We discuss these issues as they relate to research on agent design. We survey existing test beds for agents and argue for appropriate caution in their use. We end with a debate on the proper role of experimental methodology in the design and validation of planning agents.

agent, planning & scheduling, us government, (18 more...)