Edmonton
Using Payoff-Similarity to Speed Up Search
Furtak, Timothy (University of Alberta) | Buro, Michael (University of Alberta)
Transposition tables are a powerful tool in search domains for avoiding duplicate effort and for guiding node expansions. Traditionally, however, they have only been applicable when the current state is exactly the same as a previously explored state. We consider a generalized transposition table, whereby a similarity metric that exploits local structure is used to compare the current state with a neighbourhood of previously seen states. We illustrate this concept and forward pruning based on function approximation in the domain of Skat, and show that we can achieve speedups of 16+ over standard methods.
Accelerating Best Response Calculation in Large Extensive Games
Johanson, Michael (University of Alberta) | Waugh, Kevin (Carnegie Mellon University) | Bowling, Michael (University of Alberta) | Zinkevich, Martin (Yahoo! Research)
One fundamental evaluation criteria of an AI technique is its performance in the worst-case. For static strategies in extensive games, this can be computed using a best response computation. Conventionally, this requires a full game tree traversal. For very large games, such as poker, that traversal is infeasible to perform on modern hardware. In this paper, we detail a general technique for best response computations that can often avoid a full game tree traversal. Additionally, our method is specifically well-suited for parallel environments. We apply this approach to computing the worst-case performance of a number of strategies in heads-up limit Texas hold'em, which, prior to this work, was not possible. We explore these results thoroughly as they provide insight into the effects of abstraction on worst-case performance in large imperfect information games. This is a topic that has received much attention, but could not previously be examined outside of toy domains.
Recap of the 2010 AI and Interactive Digital Entertainment Conference
Youngblood, G. Michael (University of North Carolina, Charlotte) | Bulitko, Vadim (University of Alberta) | Weber, Ben (University of California, Santa Cruz)
AIIDE 2010 was held October 11-13, 2010, at Stanford University ajacent to Palo Alto, California. The conference featured 17 paper presentations, 18 posters, 5 demos, 5 invited speakers, a panel on teaching game AI in academe, and the first StarCraft AI competition. Led by the conference chair, Michael Youngblood (University of North Carolina at Charlotte), and the program chair, Vadim Bulitko (University of Alberta), the three days of AIIDE contained a dense and exciting agenda highlighting new research and revealing how AI is applied in many commercial endeavors. The first day was kicked off with an invited talk from Chris Jurney, lead developer of Double Fine Productions, who detailed his work on the nonplayer character pathfinding of Dawn of War II during his time at Relic Entertainment. The morning was completed by research presentations on behavioral techniques with notable work on producing realistic behaviors through alibi generation (Ben Sunshine-Hill and Norman Badler, University of Pennsylvania), which has been widely discussed in the community since, and Ben Weber's (University of California, Santa Cruz) work applying goal-driven autonomy to playing StarCraft (awarded AIIDE 2010 Best Student Paper).
Abstract: Block A* and Any-Angle Path-Planning
Yap, Peter Kai Yue (University of Alberta) | Burch, Neil (University of Alberta) | Holte, Robert C. (University of Alberta) | Schaeffer, Jonathan (University of Alberta)
We present three new ideas for grid-based path-planning algorithms that improve the search speed and quality of the paths found. First, we introduce a new type of database, the Local Distance Database (LDDB), that contains distances between boundary points of a local neighborhood. Second, an LDDB-based algorithm is introduced, called Block A*, that calculates the optimal path between start and goal locations given the local distances stored in the LDDB. Third, our experimental results for any-angle path planning in a wide varietyof test domains, including real game maps, show that Block A* is faster than both A* and the previously best grid-based any-angle search algorithm, Theta*.
Probably Approximately Correct Heuristic Search
Stern, Roni (Ben Gurion University) | Felner, Ariel (Ben Gurion University) | Holte, Robert (University of Alberta)
A* is a best-first search algorithm that returns an optimal solution. w-admissible algorithms guarantee that the returned solution is no larger than w times the optimal solution. In this paper we introduce a generalization of the w-admissibility concept that we call PAC search, which is inspired by the PAC learning framework in Machine Learning. The task of a PAC search algorithm is to find a solution that is w-admissible with high probability. In this paper we formally define PAC search, and present a framework for PAC search algorithms that can work on top of any search algorithm that produces a sequence of solutions. Experimental results on the 15-puzzle demonstrate that our framework activated on top of Anytime Weighted A* (AWA*) expands significantly less nodes than regular AWA* while returning solutions that have almost the same quality.
State-Set Search
Pang, Bo (University of Alberta) | Holte, Robert C. (University of Alberta)
State-set search is state space search when the states being manipulated by the search algorithm are sets of states from some underlying state space. State-set search arises commonly in planning and abstraction systems, but this paper provides the first formal, general analysis of state-set search. We show that the state-set distance computed by planning systems is different than that computed by abstraction systems and introduce a distance in between the two, dww, the maximum admissible distance. We introduce the concept of a multi-abstraction, which maps a state to more than one abstract state in the same abstract space, describe the first implementation of a multi-abstraction system that computes dww, and give initial experimental evidence that it can be superior to domain abstraction.
Improved Prediction of IDA*'s Performance via Epsilon-Truncation
Lelis, Levi (University of Alberta) | Zilles, Sandra (University of Regina) | Holte, Robert Craig (University of Alberta)
Korf, Reid, and Edelkamp launched a line of research aimed at predicting how many nodes IDA* will expand with a given cost bound. This paper advances this line of research in three ways. First, we identify a source of prediction error that has hitherto been overlooked. We call it the ``discretization effect''. Second, we disprove the intuitively appealing idea that a ``more informed'' prediction system cannot make worse predictions than a ``less informed'' one. More informed systems are more susceptible to the discretization effect, and in several of our experiments the more informed system makes poorer predictions. Our third contribution is a method, called ``$\epsilon$-truncation'', which makes a prediction system less informed, in a carefully chosen way, so as to improve its predictions by reducing the discretization effect. In our experiments $\epsilon$-truncation rarely degraded predictions; in the vast majority of cases it improved predictions, often substantially.
Predicting Solution Cost with Conditional Probabilities
Lelis, Levi (University of Alberta) | Stern, Roni (Ben Gurion University) | Arfaee, Shahab Jabbari (University of Alberta)
Classical heuristic search algorithms find the solution cost of a problem while finding the path from the start state to a goal state. However, there are applications in which finding the path is not needed. In this paper we propose an algorithm that accurately and efficiently predicts the solution cost of a problem without finding the actual solution. We show empirically that our predictor makes more accurate predictions when compared to the bootstrapped heuristic, which is known to be a very accurate inadmissible heuristic. In addition, we show how our prediction algorithm can be used to enhance heuristic search algorithms. Namely, we use our predictor to calculate a bound for a bounded best-first search algorithm and to tune the w-value of Weighted IDA*. In both cases major search speedups were observed.
Faster Optimal and Suboptimal Hierarchical Search
Leighton, Michael J. (University of New Hampshire) | Ruml, Wheeler ( University of New Hampshire ) | Holte, Robert C. (University of Alberta)
In problem domains for which an informed admissible heuristic function is not available, one attractive approach is hierarchical search. Hierarchical search uses search in an abstracted version of the problem to dynamically generate heuristic values. This paper makes two contributions to hierarchical search. First, we propose a simple modification to the state-of-the-art algorithm Switchback that reduces the number of expansions (and hence the running time) by approximately half, while maintaining its guarantee of optimality. Second, we propose a new algorithm for suboptimal hierarchical search, called Switch. Empirical results suggest that Switch yields faster search than straightforward modifications of Switchback, such as weighting the heuristic or greedy search. The success of Switch illustrates the potential for further research on specifically suboptimal hierarchical search.
A Polynomial-Time Algorithm for Non-Optimal Multi-Agent Pathfinding
Khorshid, Mokhtar M. (University of Alberta) | Holte, Robert C. (University of Alberta) | Sturtevant, Nathan R. (University of Denver)
Multi-agent pathfinding, where multiple agents must travel to their goal locations without getting stuck, has been studied in both theoretical and practical contexts, with a variety of both optimal and sub-optimal algorithms proposed for solving problems. Recent work has shown that there is a linear-time check for whether a multi-agent pathfinding problem can be solved in a tree, however this was not used to actually produce solutions. In this paper we provide a constructive proof of how to solve multi-agent pathfinding problems in a tree that culminates in a novel approach that we call the tree-based agent swapping strategy (TASS). Experimental results showed that TASS can find solutions to the multi-agent pathfinding problem on a highly crowded tree with 1000 nodes and 996 agents in less than 8 seconds. These results are far more efficient and general than existing work, suggesting that TASS is a productive line of study for multi-agent pathfinding.