Search in Imperfect Information Games
–arXiv.org Artificial Intelligence
From the very dawn of the field, search with value functions was a fundamental concept of computer games research. Turing's chess algorithm from 1950 was able to think two moves ahead, and Shannon's work on chess from $1950$ includes an extensive section on evaluation functions to be used within a search. Samuel's checkers program from 1959 already combines search and value functions that are learned through self-play and bootstrapping. TD-Gammon improves upon those ideas and uses neural networks to learn those complex value functions -- only to be again used within search. The combination of decision-time search and value functions has been present in the remarkable milestones where computers bested their human counterparts in long standing challenging games -- DeepBlue for Chess and AlphaGo for Go. Until recently, this powerful framework of search aided with (learned) value functions has been limited to perfect information games. As many interesting problems do not provide the agent perfect information of the environment, this was an unfortunate limitation. This thesis introduces the reader to sound search for imperfect information games.
arXiv.org Artificial Intelligence
Nov-10-2021
- Country:
- North America
- United States
- Texas (0.04)
- New York (0.04)
- Michigan (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Massachusetts > Middlesex County
- Belmont (0.04)
- California > Santa Clara County
- Palo Alto (0.04)
- Trinidad and Tobago > Trinidad
- Canada > Alberta
- United States
- Europe
- Czechia > Prague (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- North America
- Genre:
- Research Report > New Finding (0.92)
- Industry:
- Leisure & Entertainment > Games
- Computer Games (1.00)
- Chess (1.00)
- Leisure & Entertainment > Games
- Technology:
- Information Technology
- Game Theory (1.00)
- Artificial Intelligence
- Games > Poker (1.00)
- Representation & Reasoning
- Search (1.00)
- Agents (1.00)
- Optimization (0.92)
- Machine Learning
- Statistical Learning (1.00)
- Reinforcement Learning (1.00)
- Neural Networks > Deep Learning (0.67)
- Information Technology