NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL

Dec-23-2025, 17:32:48 GMT–Neural Information Processing Systems

Whittle index policy is a powerful tool to obtain asymptotically optimal solutions for the notoriously intractable problem of restless bandits. However, finding the Whittle indices remains a difficult problem for many practical restless bandits with convoluted transition kernels. This paper proposes NeurWIN, a neural Whittle index network that seeks to learn the Whittle indices for any restless bandits by leveraging mathematical properties of the Whittle indices. We show that a neural network that produces the Whittle index is also one that produces the optimal control for a set of Markov decision problems.

artificial intelligence, machine learning, reinforcement learning, (6 more...)

Neural Information Processing Systems

Dec-23-2025, 17:32:48 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.39)