Adaptive Policies for Sequential Sampling under Incomplete Information and a Cost Constraint

Burnetas, Apostolos, Kanavetas, Odysseas

Jan-19-2012–arXiv.org Machine Learning

In this paper we consider the problem of sequential sampling from k independent statistical populations with unknown distributions. The objective is to maximize the expected outcome per period achieved over infinite horizon, under a constraint that the expected sampling cost per period does not exceed an upper bound. The introduction of a sampling cost introduces a new dimension in the standard tradeoff between experimentation and profit maximization faced in problems of control under incomplete information. The sampling cost may prohibit using populations with high mean outcomes because their sampling cost may be too high. Instead, the decision maker must identify the subset of populations with the best combination of outcome versus cost and allocate the sampling effort among them in an optimal manner. 1 From the mathematical point of view, this class of problems incorporates statistical methodologies into mathematical programming problems.

artificial intelligence, constraint, machine learning, (18 more...)

arXiv.org Machine Learning

Jan-19-2012

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Representation & Reasoning (0.94)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found