Improving the Knowledge Gradient Algorithm
–Neural Information Processing Systems
The knowledge gradient (KG) algorithm is a popular policy for the best arm identification (BAI) problem. It is built on the simple idea of always choosing the measurement that yields the greatest expected one-step improvement in the estimate of the best mean of the arms.
Neural Information Processing Systems
Feb-16-2026, 22:49:18 GMT