AITopics | comparison training

Connectionist Learning of Expert Preferences by Comparison Training

Neural Information Processing SystemsApr-6-2023, 19:59:00 GMT

A new training paradigm, caned the "eomparison pa.radigm," is introduced for tasks in which a. network must learn to choose a prdcrred pattern from a set of n alternatives, based on examplcs of Imma.n expert prderences. In this pa.radigm, the inpu t to the network consists of t.wo uf the n alterna tives, and the trained output is the expert's judgement of which pa.ttern is better. This para.digm is applied to the lea,rning of hackgammon, a difficult board ga.me in wllieh the expert selects a move from a. set, of legal mm·es. Furthermorf', it is possible to set up the network so tha.t it always produces consisten t rank-orderings .

comparison training, connectionist learning, expert preference, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Add feedback

Comparison Training for a Rescheduling Problem in Neural Networks

Neural Information Processing SystemsApr-6-2023, 18:57:41 GMT

Many events such as flight delays or the absence of a member require the crew pool rescheduling team to change the initial schedule (rescheduling). In this paper, we show that the neural network comparison paradigm applied to the backgammon game by Tesauro (Tesauro and Se(cid:173) jnowski, 1989) can also be applied to the rescheduling problem of an aircrew pool. Indeed both problems correspond to choosing the best solut.ion The paper explains from a math(cid:173) ematical point of view the architecture and the learning strategy of the backpropagation neural network used for the best choice prob(cid:173) lem. We also show how the learning phase of the network can be accelerated.

comparison training, neural network, rescheduling problem, (1 more...)

Neural Information Processing Systems

Industry: Transportation > Air (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Large-Scale Optimization for Evaluation Functions with Minimax Search

Hoki, K., Kaneko, T.

Journal of Artificial Intelligence ResearchMar-30-2014

This paper presents a new method, Minimax Tree Optimization (MMTO), to learn a heuristic evaluation function of a practical alpha-beta search program. The evaluation function may be a linear or non-linear combination of weighted features, and the weights are the parameters to be optimized. To control the search results so that the move decisions agree with the game records of human experts, a well-modeled objective function to be minimized is designed. Moreover, a numerical iterative method is used to nd local minima of the objective function, and more than forty million parameters are adjusted by using a small number of hyper parameters. This method was applied to shogi, a major variant of chess in which the evaluation function must handle a larger state space than in chess. Experimental results show that the large-scale optimization of the evaluation function improves the playing strength of shogi programs, and the new method performs signicantly better than other methods. Implementation of the new method in our shogi program Bonanza made substantial contributions to the program's rst-place nish in the 2013 World Computer Shogi Championship. Additionally, we present preliminary evidence of broader applicability of our method to other two-player games such as chess.

evaluation function, mmto, partial derivative, (14 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.4217

AI Access Foundation

10871

Journal of Artificial Intelligence Research

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
North America > United States > New York > Suffolk County > Commack (0.04)
(4 more...)

Genre: Research Report > New Finding (1.00)

Industry: Leisure & Entertainment > Games > Chess (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Add feedback

Comparison Training for a Rescheduling Problem in Neural Networks

Keymeulen, Didier, Gerlache, Martine de

Neural Information Processing SystemsDec-31-1994

Many events such as flight delays or the absence of a member require the crew pool rescheduling team to change the initial schedule (rescheduling). In this paper, we show that the neural network comparison paradigm applied to the backgammon game by Tesauro (Tesauro and Sejnowski, 1989) can also be applied to the rescheduling problem of an aircrew pool. Indeed both problems correspond to choosing the best solut.ion

best choice problem, best profile, neural network, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Mateo County > San Mateo (0.05)
Europe > France (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)

Industry:

Transportation > Air (0.51)
Leisure & Entertainment > Games (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Comparison Training for a Rescheduling Problem in Neural Networks

Keymeulen, Didier, Gerlache, Martine de

Neural Information Processing SystemsDec-31-1994

Many events such as flight delays or the absence of a member require the crew pool rescheduling team to change the initial schedule (rescheduling). In this paper, we show that the neural network comparison paradigm applied to the backgammon game by Tesauro (Tesauro and Sejnowski, 1989) can also be applied to the rescheduling problem of an aircrew pool. Indeed both problems correspond to choosing the best solut.ion

best choice problem, best profile, neural network, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Mateo County > San Mateo (0.05)
Europe > France (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)

Industry:

Transportation > Air (0.51)
Leisure & Entertainment > Games (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Comparison Training for a Rescheduling Problem in Neural Networks

Keymeulen, Didier, Gerlache, Martine de

Neural Information Processing SystemsDec-31-1994

Many events such as flight delays or the absence of a member require the crew pool rescheduling team to change the initial schedule (rescheduling). In this paper, we show that the neural network comparison paradigm applied to the backgammon game by Tesauro (Tesauro and Sejnowski, 1989)can also be applied to the rescheduling problem of an aircrew pool. Indeed both problems correspond to choosing the best solut.ion

artificial intelligence, machine learning, neural network, (16 more...)

Neural Information Processing Systems

Country:

Europe > Belgium (0.15)
North America > United States (0.14)

Industry:

Transportation > Air (0.51)
Leisure & Entertainment > Games (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Connectionist Learning of Expert Preferences by Comparison Training

Tesauro, Gerald

Neural Information Processing SystemsDec-31-1989

A new training paradigm, caned the "eomparison pa.radigm," is introduced for tasks in which a. network must learn to choose a prdcrred pattern from a set of n alternatives, based on examplcs of Imma.n expert prderences. In this pa.radigm, the inpu t to the network consists of t.wo uf the n alterna tives, and the trained output is the expert's judgement of which pa.ttern is better. This para.digm is applied to the lea,rning of hackgammon, a difficult board ga.me in wllieh the expert selects a move from a. set, of legal mm·es.

alterna, compa, connectionist learning, (12 more...)

Neural Information Processing Systems

Country: North America > United States (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Connectionist Learning of Expert Preferences by Comparison Training

Tesauro, Gerald

Neural Information Processing SystemsDec-31-1989

A new training paradigm, caned the "eomparison pa.radigm," is introduced for tasks in which a. network must learn to choose a prdcrred pattern from a set of n alternatives, based on examplcs of Imma.n expert prderences. In this pa.radigm, the inpu t to the network consists of t.wo uf the n alterna tives, and the trained output is the expert's judgement of which pa.ttern is better. This para.digm is applied to the lea,rning of hackgammon, a difficult board ga.me in wllieh the expert selects a move from a. set, of legal mm·es.

alterna, compa, connectionist learning, (12 more...)

Neural Information Processing Systems

Country: North America > United States (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback