Adaptive Sampling for Discovery

Xu, Ziping, Shim, Eunjae, Tewari, Ambuj, Zimmerman, Paul

Jan-2-2023–arXiv.org Artificial Intelligence

In this paper, we study a sequential decision-making problem, called Adaptive Sampling for Discovery (ASD). Starting with a large unlabeled dataset, algorithms for ASD adaptively label the points with the goal to maximize the sum of responses. This problem has wide applications to real-world discovery problems, for example drug discovery with the help of machine learning models. ASD algorithms face the well-known exploration-exploitation dilemma. The algorithm needs to choose points that yield information to improve model estimates but it also needs to exploit the model. We rigorously formulate the problem and propose a general information-directed sampling (IDS) algorithm. We provide theoretical guarantees for the performance of IDS in linear, graph and low-rank models. The benefits of IDS are shown in both simulation experiments and real-data experiments for discovering chemical reaction conditions.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Artificial Intelligence

Jan-2-2023

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.83)

Industry:
- Energy > Oil & Gas
  - Upstream (0.34)
- Health & Medicine > Pharmaceuticals & Biotechnology (0.70)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning
    - Statistical Learning (0.47)
  - Data Science > Data Mining
    - Big Data (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found