Goto

Collaborating Authors

 critical position





Game Solving with Online Fine-Tuning

arXiv.org Artificial Intelligence

Game solving is a similar, yet more difficult task than mastering a game. Solving a game typically means to find the game-theoretic value (outcome given optimal play), and optionally a full strategy to follow in order to achieve that outcome. The AlphaZero algorithm has demonstrated super-human level play, and its powerful policy and value predictions have also served as heuristics in game solving. However, to solve a game and obtain a full strategy, a winning response must be found for all possible moves by the losing player. This includes very poor lines of play from the losing side, for which the AlphaZero self-play process will not encounter. AlphaZero-based heuristics can be highly inaccurate when evaluating these out-of-distribution positions, which occur throughout the entire search. To address this issue, this paper investigates applying online fine-tuning while searching and proposes two methods to learn tailor-designed heuristics for game solving. Our experiments show that using online fine-tuning can solve a series of challenging 7x7 Killall-Go problems, using only 23.54% of computation time compared to the baseline without online fine-tuning. Results suggest that the savings scale with problem size. Our method can further be extended to any tree search algorithm for problem solving. Our code is available at https://rlg.iis.sinica.edu.tw/papers/neurips2023-online-fine-tuning-solver.


In Manufacturing, Closing A Skills Gap Requires Fixing An Image Problem

Huffington Post - Tech news and opinion

For starters, technology careers in our industry are flourishing. The digital transformation of manufacturing has opened doors to STEM-rooted careers in the fields of software development, mechanical engineering and computational data sciences. This expertise is absolutely critical in further advancing both conventional manufacturing processes, like injection molding and CNC machining as well as industrial 3D printing. The proliferation of industrial robotic equipment, automation controls, digital scanning devices, and M2M learnings has and will continue to create an increased demand for a highly-skilled workforce capable of operating and maintaining this type of equipment. What's more, a career in manufacturing can be lucrative.


The Propagation Depth of Local Consistency

arXiv.org Artificial Intelligence

We establish optimal bounds on the number of nested propagation steps in $k$-consistency tests. It is known that local consistency algorithms such as arc-, path- and $k$-consistency are not efficiently parallelizable. Their inherent sequential nature is caused by long chains of nested propagation steps, which cannot be executed in parallel. This motivates the question "What is the minimum number of nested propagation steps that have to be performed by $k$-consistency algorithms on (binary) constraint networks with $n$ variables and domain size $d$?" It was known before that 2-consistency requires $\Omega(nd)$ and 3-consistency requires $\Omega(n^2)$ sequential propagation steps. We answer the question exhaustively for every $k\geq 2$: there are binary constraint networks where any $k$-consistency procedure has to perform $\Omega(n^{k-1}d^{k-1})$ nested propagation steps before local inconsistencies were detected. This bound is tight, because the overall number of propagation steps performed by $k$-consistency is at most $n^{k-1}d^{k-1}$.