adaptive experimental design
Minimax and Bayes Optimal Best-arm Identification: Adaptive Experimental Design for Treatment Choice
This study investigates adaptive experimental design for treatment choice, also known as fixed-budget best-arm identification. We consider an adaptive procedure consisting of a treatment-allocation phase followed by a treatment-choice phase, and we design an adaptive experiment for this setup to efficiently identify the best treatment arm, defined as the one with the highest expected outcome. In our designed experiment, the treatment-allocation phase consists of two stages. The first stage is a pilot phase, where we allocate each treatment arm uniformly with equal proportions to eliminate clearly suboptimal arms and estimate outcome variances. In the second stage, we allocate treatment arms in proportion to the variances estimated in the first stage. After the treatment-allocation phase, the procedure enters the treatment-choice phase, where we choose the treatment arm with the highest sample mean as our estimate of the best treatment arm. We prove that this single design is simultaneously asymptotically minimax and Bayes optimal for the simple regret, with upper bounds that match our lower bounds up to exact constants. Therefore, our designed experiment achieves the sharp efficiency limits without requiring separate tuning for minimax and Bayesian objectives.
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
- North America > United States > New York (0.04)
- (2 more...)
Review for NeurIPS paper: Adaptive Experimental Design with Temporal Interference: A Maximum Likelihood Approach
Weaknesses: - Can we interpret the results as follows: If the TAR assumption is satisfied with positive limits, and we use MLE, then temporal interference does not cause bias. If this interpretation is correct, then it would be illuminating if the authors provide the intuitive connection between the TAR assumption and temporal interference. It is not clear if the estimations that the authors have required are feasible if the state space is large. The next natural question is how robust the results are if we use other methods for estimation. This could have been shown by providing some simulations, which is a part missing from the manuscript.
Review for NeurIPS paper: Adaptive Experimental Design with Temporal Interference: A Maximum Likelihood Approach
The paper studied the online experimental design problem where there are temporal dependencies between the two control policies/treatments. The novelty of the problem setup and the theoretical analysis in the paper are appreciated by all the reviewers. Although the analysis is the main contribution, the paper would be much stronger if there are meaningful experiments on toy problems to showcase the performance the online MLE-based approach vs the standard experimental design approaches.
Adaptive Experimental Design with Temporal Interference: A Maximum Likelihood Approach
Suppose an online platform wants to compare a treatment and control policy (e.g., two different matching algorithms in a ridesharing system, or two different inventory management algorithms in an online retail site). Standard experimental approaches to this problem are biased (due to temporal interference between the policies), and not sample efficient. We study optimal experimental design for this setting. We view testing the two policies as the problem of estimating the steady state difference in reward between two unknown Markov chains (i.e., policies). We assume estimation of the steady state reward for each chain proceeds via nonparametric maximum likelihood, and search for consistent (i.e., asymptotically unbiased) experimental designs that are efficient (i.e., asymptotically minimum variance).
Adaptive Experimental Design and Counterfactual Inference
Fiez, Tanner, Gamez, Sergio, Chen, Arick, Nassif, Houssam, Jain, Lalit
Yet, experimenters are steadily shifting toward Adaptive Experimental Design (AED) methods with the goal of increasing testing throughput or reducing the cost of experimentation. AED promises to use a fraction of the impressions that traditional A/B/N tests require to yield high confidence inferences or to directly drive business impact. In this paper, we share lessons learned regarding the challenges and pitfalls of naively using adaptive experimentation systems in industrial settings where non-stationarity is the norm rather than the exception. Moreover, we provide perspectives on the proper objectives and system specifications in these settings. This culminates in a high level presentation of an AED framework for counterfactual inference. To provide a robust and flexible tool for experimenters with performance certificates at minimal cost, our methodology combines cumulative gain estimators, always-valid confidence intervals, and an elimination algorithm.
- North America > United States > Washington > King County > Seattle (0.05)
- North America > United States > New York > New York County > New York City (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Taxonomy of A Decision Support System for Adaptive Experimental Design in Field Robotics
Gregory, Jason M., Al-Hussaini, Sarah, Agha-mohammadi, Ali-akbar, Gupta, Satyandra K.
Experimental design in field robotics is an adaptive human-in-the-loop decision-making process in which an experimenter learns about system performance and limitations through interactions with a robot in the form of constructed experiments. This can be challenging because of system complexity, the need to operate in unstructured environments, and the competing objectives of maximizing information gain while simultaneously minimizing experimental costs. Based on the successes in other domains, we propose the use of a Decision Support System (DSS) to amplify the human's decision-making abilities, overcome their inherent shortcomings, and enable principled decision-making in field experiments. In this work, we propose common terminology and a six-stage taxonomy of DSSs specifically for adaptive experimental design of more informative tests and reduced experimental costs. We construct and present our taxonomy using examples and trends from DSS literature, including works involving artificial intelligence and Intelligent DSSs. Finally, we identify critical technical gaps and opportunities for future research to direct the scientific community in the pursuit of next-generation DSSs for experimental design.
- North America > United States > California > Los Angeles County > Los Angeles (0.14)
- Africa > Eswatini > Manzini > Manzini (0.04)
- North America > United States > Maryland > Prince George's County > Adelphi (0.04)
- (3 more...)
- Health & Medicine (1.00)
- Government (0.68)