AITopics

Industry: Leisure & Entertainment > Games > Backgammon (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Games > Backgammon (1.00)

Neural Information Processing SystemsDec-31-1997

Why did TD-Gammon Work?

Pollack, Jordan B., Blair, Alan D.

Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or even other games. We were able to replicate some of the success of TD-Gammon, developing a competitive evaluation function on a 4000 parameter feed-forward neural network, without using back-propagation, reinforcement or temporal difference learning methods. Instead we apply simple hill-climbing in a relative fitness environment. These results and further analysis suggest that the surprising success of Tesauro's program had more to do with the co-evolutionary structure of the learning task and the dynamics of the backgammon game itself. 1 INTRODUCTION It took great chutzpah for Gerald Tesauro to start wasting computer cycles on temporal difference learning in the game of Backgammon (Tesauro, 1992). After all, the dream of computers mastering a domain by self-play or "introspection" had been around since the early days of AI, forming part of Samuel's checker player (Samuel, 1959) and used in Donald Michie's MENACE tictac-toe learner (Michie, 1961).

artificial intelligence, backgammon, tesauro, (15 more...)

Country: North America > United States > Massachusetts > Middlesex County (0.14)

Industry: Leisure & Entertainment > Games > Backgammon (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Neural Information Processing SystemsDec-31-1994

Structural and Behavioral Evolution of Recurrent Networks

Saunders, Gregory M., Angeline, Peter J., Pollack, Jordan B.

This paper introduces GNARL, an evolutionary program which induces recurrent neural networks that are structurally unconstrained. In contrast to constructive and destructive algorithms, GNARL employs a population of networks and uses a fitness function's unsupervised feedback to guide search through network space. Annealing is used in generating both gaussian weight changes and structural modifications. Applying GNARL to a complex search and collection task demonstrates that the system is capable of inducing networks with complex internal dynamics.

artificial intelligence, neural network, structural and behavioral evolution, (15 more...)

Country:

North America > United States > Ohio (0.16)
North America > United States > California (0.15)
North America > United States > Michigan (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Neural Information Processing SystemsDec-31-1994

Structural and Behavioral Evolution of Recurrent Networks

Saunders, Gregory M., Angeline, Peter J., Pollack, Jordan B.

This paper introduces GNARL, an evolutionary program which induces recurrent neural networks that are structurally unconstrained. In contrast to constructive and destructive algorithms, GNARL employs a population ofnetworks and uses a fitness function's unsupervised feedback to guide search through network space. Annealing is used in generating both gaussian weight changes and structural modifications. Applying GNARL to a complex search and collection task demonstrates that the system is capable of inducing networks with complex internal dynamics.

artificial intelligence, neural network, structural and behavioral evolution, (15 more...)

Country:

North America > United States > Ohio (0.16)
North America > United States > California (0.15)
North America > United States > Michigan (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Neural Information Processing SystemsDec-31-1994

Structural and Behavioral Evolution of Recurrent Networks

Saunders, Gregory M., Angeline, Peter J., Pollack, Jordan B.

This paper introduces GNARL, an evolutionary program which induces recurrent neural networks that are structurally unconstrained. In contrast to constructive and destructive algorithms, GNARL employs a population of networks and uses a fitness function's unsupervised feedback to guide search through network space. Annealing is used in generating both gaussian weight changes and structural modifications. Applying GNARL to a complex search and collection task demonstrates that the system is capable of inducing networks with complex internal dynamics.

artificial intelligence, neural network, structural and behavioral evolution, (15 more...)

Country:

North America > United States > Ohio (0.16)
North America > United States > California (0.15)
North America > United States > Michigan (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Language Induction by Phase Transition in Dynamical Recognizers

Pollack, Jordan B.

A higher order recurrent neural network architecture learns to recognize and generate languages after being "trained" on categorized exemplars. Studying these networks from the perspective of dynamical systems yields two interesting discoveries: First, a longitudinal examination of the learning process illustrates a new form of mechanical inference: Induction by phase transition. A small weight adjustment causes a "bifurcation" in the limit behavior of the network.

artificial intelligence, machine learning, neural network, (13 more...)

Country:

North America > United States > Ohio (0.15)
North America > United States > California (0.15)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Back Propagation is Sensitive to Initial Conditions

Kolen, John F., Pollack, Jordan B.

This paper explores the effect of initial weight selection on feed-forward networks learning simple functions with the back-propagation technique.

artificial intelligence, back propagation, neural network, (16 more...)

Country: North America > United States > Ohio (0.15)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.34)

Back Propagation is Sensitive to Initial Conditions

Kolen, John F., Pollack, Jordan B.

This paper explores the effect of initial weight selection on feed-forward networks learning simple functions with the back-propagation technique.

artificial intelligence, back propagation, neural network, (17 more...)

Country: North America > United States > Ohio (0.15)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.34)

Language Induction by Phase Transition in Dynamical Recognizers

Pollack, Jordan B.

A higher order recurrent neural network architecture learns to recognize and generate languages after being "trained" on categorized exemplars. Studying these networks from the perspective of dynamical systems yields two interesting discoveries: First, a longitudinal examination of the learning process illustrates a new form of mechanical inference: Induction by phase transition. A small weight adjustment causes a "bifurcation" in the limit behavior of the network.

artificial intelligence, machine learning, neural network, (13 more...)

Country:

North America > United States > Ohio (0.15)
North America > United States > California (0.15)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsDec-31-1989

Implications of Recursive Distributed Representations

Pollack, Jordan B.

I will describe my recent results on the automatic development of fixedwidth recursivedistributed representations of variable-sized hierarchal data structures. One implication of this wolk is that certain types of AIstyle data-structures can now be represented in fixed-width analog vectors. Simple inferences can be perfonned using the type of pattern associations that neural networks excel at Another implication arises from noting that these representations become self-similar in the limit Once this door to chaos is opened.

artificial intelligence, neural network, representation, (16 more...)