Goto

Collaborating Authors

 identification









Improving the Knowledge Gradient Algorithm

Neural Information Processing Systems

The knowledge gradient (KG) algorithm is a popular policy for the best arm identification (BAI) problem. It is built on the simple idea of always choosing the measurement that yields the greatest expected one-step improvement in the estimate of the best mean of the arms.