Improving the Knowledge Gradient Algorithm
–Neural Information Processing Systems
The knowledge gradient (KG) algorithm is a popular policy for the best arm identification (BAI) problem. It is built on the simple idea of always choosing the measurement that yields the greatest expected one-step improvement in the estimate of the best mean of the arms.
Neural Information Processing Systems
Oct-9-2025, 06:39:20 GMT