AITopics

1901.01855

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Kroer, Christian, Sandholm, Tuomas

Limited Lookahead in Imperfect-Information Games

arXiv.org Artificial IntelligenceFeb-17-2019

Limited lookahead has been studied for decades in complete-information games. We initiate a new direction via two simultaneous deviation points: generalization to incomplete-information games and a game-theoretic approach. We study how one should act when facing an opponent whose lookahead is limited. We study this for opponents that differ based on their lookahead depth, based on whether they, too, have incomplete information, and based on how they break ties. We characterize the hardness of finding a Nash equilibrium or an optimal commitment strategy for either player, showing that in some of these variations the problem can be solved in polynomial time while in others it is PPAD-hard or NP-hard. We proceed to design algorithms for computing optimal commitment strategies---for when the opponent breaks ties favorably, according to a fixed rule, or adversarially. We then experimentally investigate the impact of limited lookahead. The limited-lookahead player often obtains the value of the game if she knows the expected values of nodes in the game tree for some equilibrium---but we prove this is not sufficient in general. Finally, we study the impact of noise in those estimates and different lookahead depths. This uncovers an incomplete-information game lookahead pathology.

artificial intelligence, information, machine learning, (17 more...)

1902.06335

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
North America > United States > Texas (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
North America > United States > Michigan > Washtenaw County > Ann Arbor (0.04)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.67)

arXiv.org Artificial IntelligenceFeb-16-2019

Re-determinizing Information Set Monte Carlo Tree Search in Hanabi

Goodman, James

This technical report documents the winner of the Computational Intelligence in Games(CIG) 2018 Hanabi competition. We introduce Re-determinizing IS-MCTS, a novel extension of Information Set Monte Carlo Tree Search (IS-MCTS) \cite{IS-MCTS} that prevents a leakage of hidden information into opponent models that can occur in IS-MCTS, and is particularly severe in Hanabi. Re-determinizing IS-MCTS scores higher in Hanabi for 2-4 players than previously published work. Given the 40ms competition time limit per move we use a learned evaluation function to estimate leaf node values and avoid full simulations during MCTS. For the Mixed track competition, in which the identity of the other players is unknown, a simple Bayesian opponent model is used that is updated as each game proceeds.

artificial intelligence, information, machine learning, (18 more...)

1902.06075

Genre: Research Report (0.64)

Industry:

Leisure & Entertainment > Games (1.00)
Leisure & Entertainment > Sports (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

AI ClassicsFeb-14-2019, 17:54:29 GMT

Readings in Medical Artificial Intelligence: The First Decade

William J. Clancey

A survey of early work exploring how AI can be used in medicine, with somewhat more technical expositions than in the complementary volume Artificial Intelligence in Medicine."Each chapter is preceded by a brief introduction that outlines our view of its contribution to the field, the reason it was selected for inclusion in this volume, an overview of its content, and a discussion of how the work evolved after the article appeared and how it relates to other chapters in the book.

diagnostic medicine, university of pittsburgh, university of wisconsin, (106 more...)

AI Classics

Country:

North America > United States > California (1.00)
North America > Canada (0.92)
Europe > United Kingdom (0.67)

Genre:

Overview (1.20)
Research Report > Experimental Study (1.00)
Research Report > New Finding (1.00)
(4 more...)

Industry:

Health & Medicine > Therapeutic Area > Internal Medicine (1.02)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.01)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.01)
(27 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.06)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (1.03)
Information Technology > Knowledge Management > Knowledge Engineering (1.02)
(18 more...)

Kleinberg, Robert, Leyton-Brown, Kevin, Lucier, Brendan, Graham, Devon

Procrastinating with Confidence: Near-Optimal, Anytime, Adaptive Algorithm Configuration

arXiv.org Artificial IntelligenceFeb-14-2019

Algorithm configuration methods optimize the performance of a parameterized heuristic algorithm on a given distribution of problem instances. Recent work introduced an algorithm configuration procedure ('Structured Procrastination') that provably achieves near optimal performance with high probability and with nearly minimal runtime in the worst case. It also offers an $\textit{anytime}$ property: it keeps tightening its optimality guarantees the longer it is run. Unfortunately, Structured Procrastination is not $\textit{adaptive}$ to characteristics of the parameterized algorithm: it treats every input like the worst case. Follow-up work ('Leaps and Bounds') achieves adaptivity but trades away the anytime property. This paper introduces a new algorithm configuration method, 'Structured Procrastination with Confidence', that preserves the near-optimality and anytime properties of Structured Procrastination while adding adaptivity. In particular, the new algorithm will perform dramatically faster in settings where many algorithm configurations perform poorly; we show empirically that such settings arise frequently in practice.

artificial intelligence, data mining, machine learning, (17 more...)

1902.05454

Country: North America > Canada > British Columbia (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.46)

arXiv.org Machine LearningFeb-13-2019

On Reinforcement Learning Using Monte Carlo Tree Search with Supervised Learning: Non-Asymptotic Analysis

Shah, Devavrat, Xie, Qiaomin, Xu, Zhi

Inspired by the success of AlphaGo Zero (AGZ) which utilizes Monte Carlo Tree Search (MCTS) with Supervised Learning via Neural Network to learn the optimal policy and value function, in this work, we focus on establishing formally that such an approach indeed finds optimal policy asymptotically, as well as establishing non-asymptotic guarantees in the process. We shall focus on infinite-horizon discounted Markov Decision Process to establish the results. To start with, it requires establishing the MCTS's claimed property in the literature that for any given query state, MCTS provides approximate value function for the state with enough simulation steps of MDP. We provide non-asymptotic analysis establishing this property by analyzing a non-stationary multi-arm bandit setup. Our proof suggests that MCTS needs to be utilized with polynomial rather than logarithmic "upper confidence bound" for establishing its desired performance -- interestingly enough, AGZ chooses such polynomial bound. Using this as a building block, combined with nearest neighbor supervised learning, we argue that MCTS acts as a "policy improvement" operator; it has a natural "bootstrapping" property to iteratively improve value function approximation for all states, due to combining with supervised learning, despite evaluating at only finitely many states. In effect, we establish that to learn $\varepsilon$ approximation of value function in $\ell_\infty$ norm, MCTS combined with nearest-neighbors requires samples scaling as $\widetilde{O}\big(\varepsilon^{-(d+4)}\big)$, where $d$ is the dimension of the state space. This is nearly optimal due to a minimax lower bound of $\widetilde{\Omega}\big(\varepsilon^{-(d+2)}\big).$

algorithm, mct, node, (14 more...)

1902.05213

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > New York > Tompkins County > Ithaca (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry: Leisure & Entertainment > Games > Go (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)

arXiv.org Machine LearningFeb-13-2019

ATMSeer: Increasing Transparency and Controllability in Automated Machine Learning

Wang, Qianwen, Ming, Yao, Jin, Zhihua, Shen, Qiaomu, Liu, Dongyu, Smith, Micah J., Veeramachaneni, Kalyan, Qu, Huamin

To relieve the pain of manually selecting machine learning algorithms and tuning hyperparameters, automated machine learning (AutoML) methods have been developed to automatically search for good models. Due to the huge model search space, it is impossible to try all models. Users tend to distrust automatic results and increase the search budget as much as they can, thereby undermining the efficiency of AutoML. To address these issues, we design and implement ATMSeer, an interactive visualization tool that supports users in refining the search space of AutoML and analyzing the results. To guide the design of ATMSeer, we derive a workflow of using AutoML based on interviews with machine learning experts. A multi-granularity visualization is proposed to enable users to monitor the AutoML process, analyze the searched models, and refine the search space in real time. We demonstrate the utility and usability of ATMSeer through two case studies, expert interviews, and a user study with 13 end users.

algorithm, atmseer, automl process, (14 more...)

doi: 10.1145/3290605.3300911

1902.05009

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > Scotland > City of Glasgow > Glasgow (0.04)
Asia > China > Hong Kong (0.04)
(4 more...)

Genre:

Research Report (1.00)
Questionnaire & Opinion Survey (1.00)
Personal > Interview (0.48)

Industry:

Health & Medicine > Consumer Health (0.68)
Health & Medicine > Therapeutic Area > Neurology (0.54)
Health & Medicine > Therapeutic Area > Musculoskeletal (0.54)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.79)

AI ClassicsFeb-12-2019, 22:35:30 GMT

Readings in Medical Artificial Intelligence

JANICE S. AIKINS Dr. Aikins received her Ph.D. in computer science from Stanford University in 1980. She is currently a research computer scientist at IBM's Palo Alto Scientific Center. She specializes in designing systems with an emphasis on the explicit representation of control knowledge in expert systems. ROBERT L. BLUM Dr. Blum received his M.D. from the University of California Medical School at San Francisco in 1973. From 1973 to 1976 he did an internship and residency in the Department of Internal Medicine at the Kaiser Foundation Hospital in Oakland, California, where he was chief resident in 1976.

diagnostic medicine, university of pittsburgh, university of wisconsin, (98 more...)

AI Classics

Country:

North America > United States > California > Santa Clara County (0.34)
North America > United States > California > San Francisco County > San Francisco (0.34)
North America > United States > California > Alameda County (0.33)

Genre:

Research Report > Experimental Study (1.00)
Summary/Review (1.00)
Research Report > New Finding (1.00)
(4 more...)

Industry:

Health & Medicine > Therapeutic Area > Internal Medicine (1.22)
Health & Medicine > Health Care Providers & Services (1.21)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.01)
(23 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.26)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.21)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (1.03)
(15 more...)

arXiv.org Machine LearningFeb-12-2019

A^2-Net: Molecular Structure Estimation from Cryo-EM Density Volumes

Xu, Kui, Wang, Zhe, Shi, Jiangping, Li, Hongsheng, Zhang, Qiangfeng Cliff

Constructing of molecular structural models from Cryo-Electron Microscopy (Cryo-EM) density volumes is the critical last step of structure determination by Cryo-EM technologies. Methods have evolved from manual construction by structural biologists to perform 6D translation-rotation searching, which is extremely compute-intensive. In this paper, we propose a learning-based method and formulate this problem as a vision-inspired 3D detection and pose estimation task. We develop a deep learning framework for amino acid determination in a 3D Cryo-EM density volume. We also design a sequence-guided Monte Carlo Tree Search (MCTS) to thread over the candidate amino acids to form the molecular structure. This framework achieves 91% coverage on our newly proposed dataset and takes only a few minutes for a typical structure with a thousand amino acids. Our method is hundreds of times faster and several times more accurate than existing automated solutions without any human intervention.

acid, amino acid, detection, (15 more...)

1901.00785

Country:

Asia > China > Hong Kong (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.50)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Education > Health & Safety > School Nutrition (0.99)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.89)

de Franca, Fabricio Olivetti, Aldeia, Guilherme Seidyo Imai

Interaction-Transformation Evolutionary Algorithm for Symbolic Regression

arXiv.org Machine LearningFeb-11-2019

Abstract--The Interaction-Transformation (IT) is a new representation for Symbolic Regression that restricts the search space into simpler, but expressive, function forms. This representation has the advantage of creating a smoother search space unlike the space generated by Expression Trees, the common representation used in Genetic Programming. This paper introduces an Evolutionary Algorithmcapable of evolving a population of IT expressions supported only by the mutation operator. The results show that this representation is capable of finding better approximations to real-world data sets when compared to traditional approaches and a state-of-the-art Genetic Programming algorithm. I. INTRODUCTION Regression analysis has the objective of describing the relationship between measurable variables [1]. This analysis can be used to make predictions of not yet observed samples, to study a system's behavior or to calculate the statistical properties of such system. F. O. de Franca is with Federal University of ABC, Center for Mathematics, Computationand Cognition, Heuristics, Analysis and Learning Laboratory, São Paulo, Brazil, email: folivetti@ufabc.edu.br,

algorithm, expression, representation, (13 more...)

1902.03983

Country: South America > Brazil > São Paulo (0.25)

Genre:

Research Report > New Finding (0.54)
Research Report > Experimental Study (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.94)