Not enough data to create a plot.
Try a different view from the menu above.
Miikkulainen, Risto
Evolving Adaptive Poker Players for Effective Opponent Exploitation
Li, Xun (The University of Texas at Austin) | Miikkulainen, Risto (The University of Texas at Austin)
In many imperfect information games, the ability to exploit the opponent is crucial for achieving high performance. For instance, skilled poker players usually capitalize on various weaknesses in their opponentsโ playing patterns and styles to maximize their earnings. Therefore, it is important to enable computer players in such games to identify flaws in opponent strategies and adapt their behaviors to exploit these flaws. This paper presents a genetic algorithm to evolve adaptive LSTM (Long Short Term Memory) poker players featuring effective opponent exploitation. Experimental results in heads-up no-limit Texas Holdโem demonstrate that adaptive LSTM players are able to obtain 40% to 1360% more earnings than cutting-edge game theoretic poker players against opponents with various flawed strategies. In addition, experimental results indicate that adaptive LSTM players evolved through playing against simple and weak rule-based opponents can achieve comparable performance against top game-theoretic poker players. The approach introduced in this paper is a promising start for building adaptive computer players for imperfect information games.
Object-Model Transfer in the General Video Game Domain
Braylan, Alexander Eric (University of Texas at Austin) | Miikkulainen, Risto (University of Texas at Austin)
A transfer learning approach is presented to address the challenge of training video game agents with limited data. The approach decomposes games into objects, learns object models, and transfers models from known games to unfamiliar games to guide learning. Experiments show that the approach improves prediction accuracy over a comparable control, leading to more efficient exploration. Training of game agents is thus accelerated by transferring object models from previously learned games.
Reuse of Neural Modules for General Video Game Playing
Braylan, Alexander (The University of Texas at Austin) | Hollenbeck, Mark (The University of Texas at Austin) | Meyerson, Elliot (The University of Texas at Austin) | Miikkulainen, Risto (The University of Texas at Austin)
A general approach to knowledge transfer is introduced in which an agent controlled by a neural network adapts how it reuses existing networks as it learns in a new domain. Networks trained for a new domain can improve their performance by routing activation selectively through previously learned neural structure, regardless of how or for what it was learned. A neuroevolution implementation of this approach is presented with application to high-dimensional sequential decision-making domains. This approach is more general than previous approaches to neural transfer for reinforcement learning. It is domain-agnostic and requires no prior assumptions about the nature of task relatedness or mappings. The method is analyzed in a stochastic version of the Arcade Learning Environment, demonstrating that it improves performance in some of the more complex Atari 2600 games, and that the success of transfer can be predicted based on a high-level characterization of game dynamics.
Neural-Symbolic Learning and Reasoning: Contributions and Challenges
Garcez, Artur d' (City University London) | Avila (Universitaet Onsnabrueck) | Besold, Tarek R. (KU Leuven) | Raedt, Luc de (University of St. Andrews) | Fรถldiak, Peter (Wright State University) | Hitzler, Pascal (Stanford University) | Icard, Thomas (Universitaet Osnabrueck) | Kรผhnberger, Kai-Uwe (Institute of Informatics, UFRGS) | Lamb, Luis C. (University of Texas at Austin) | Miikkulainen, Risto (Acadia University) | Silver, Daniel L.
The goal of neural-symbolic computation is to integrate robust connectionist learning and sound symbolic reasoning. With the recent advances in connectionist learning, in particular deep neural networks, forms of representation learning have emerged. However, such representations have not become useful for reasoning. Results from neural-symbolic computation have shown to offer powerful alternatives for knowledge representation, learning and reasoning in neural computation. This paper recalls the main contributions and discusses key challenges for neural-symbolic integration which have been identified at a recent Dagstuhl seminar.
Frame Skip Is a Powerful Parameter for Learning to Play Atari
Braylan, Alex (The University of Texas at Austin) | Hollenbeck, Mark (The University of Texas at Austin) | Meyerson, Elliot (The University of Texas at Austin) | Miikkulainen, Risto (The University of Texas at Austin)
We show that setting a reasonable frame skip can be critical to the performance of agents learning to play Atari 2600 games. In all of the six games in our experiments, frame skip is a strong determinant of success. For two of these games, setting a large frame skip leads to state-of-the-art performance.
GRADE: Machine Learning Support for Graduate Admissions
Waters, Austin (University of Texas at Austin) | Miikkulainen, Risto (University of Texas at Austin)
This article describes GRADE, a statistical machine learning system developed to support the work of the graduate admissions committee at the University of Texas at Austin Department of Computer Science (UTCS). In recent years, the number of applications to the UTCS PhD program has become too large to manage with a traditional review process. GRADE makes the review process more efficient by enabling reviewers to spend most of their time on applicants near the decision boundary and by focusing their attention on parts of each applicant's file that matter the most. An evaluation over two seasons of PhD admissions indicates that the system leads to dramatic time savings, reducing the total time spent on reviews by at least 74 percent.
GRADE: Machine Learning Support for Graduate Admissions
Waters, Austin (University of Texas at Austin) | Miikkulainen, Risto (University of Texas at Austin)
This article describes GRADE, a statistical machine learning system developed to support the work of the graduate admissions committee at the University of Texas at Austin Department of Computer Science (UTCS). In recent years, the number of applications to the UTCS PhD program has become too large to manage with a traditional review process. GRADE uses historical admissions data to predict how likely the committee is to admit each new applicant. It reports each prediction as a score similar to those used by human reviewers, and accompanies each by an explanation of what applicant features most influenced its prediction. GRADE makes the review process more efficient by enabling reviewers to spend most of their time on applicants near the decision boundary and by focusing their attention on parts of each applicantโs file that matter the most. An evaluation over two seasons of PhD admissions indicates that the system leads to dramatic time savings, reducing the total time spent on reviews by at least 74 percent.
GRADE: Machine Learning Support for Graduate Admissions
Waters, Austin (University of Texas at Austin) | Miikkulainen, Risto (University of Texas at Austin)
This paper describes GRADE, a statistical machine learning system developed to support the work of the graduate admissions committee at the University of Texas at Austin Department of Computer Science (UTCS). In recent years, the number of applications to the UTCS PhD program has become too large to manage with a traditional review process. GRADE uses historical admissions data to predict how likely the committee is to admit each new applicant. It reports each prediction as a score similar to those used by human reviewers, and accompanies each by an explanation of what applicant features most influenced its prediction. GRADE makes the review process more efficient by enabling reviewers to spend most of their time on applicants near the decision boundary and by focusing their attention on parts of each applicantโs file that matter the most. An evaluation over two seasons of PhD admissions indicates that the system leads to dramatic time savings, reducing the total time spent on reviews by at least 74%.
Latent Class Models for Algorithm Portfolio Methods
Silverthorn, Bryan (The University of Texas at Austin) | Miikkulainen, Risto (The University of Texas at Austin)
Different solvers for computationally difficult problems such as satisfiability (SAT) perform best on different instances. Algorithm portfolios exploit this phenomenon by predicting solvers' performance on specific problem instances, then shifting computational resources to the solvers that appear best suited. This paper develops a new approach to the problem of making such performance predictions: natural generative models of solver behavior. Two are proposed, both following from an assumption that problem instances cluster into latent classes: a mixture of multinomial distributions, and a mixture of Dirichlet compound multinomial distributions. The latter model extends the former to capture burstiness, the tendency of solver outcomes to recur. These models are integrated into an algorithm portfolio architecture and used to run standard SAT solvers on competition benchmarks. This approach is found competitive with the most prominent existing portfolio, SATzilla, which relies on domain-specific, hand-selected problem features; the latent class models, in contrast, use minimal domain knowledge. Their success suggests that these models can lead to more powerful and more general algorithm portfolio methods.
Intrusion Detection with Neural Networks
Ryan, Jake, Lin, Meng-Jang, Miikkulainen, Risto