Context-dependent upper-confidence bounds for directed exploration

Raksha Kumaraswamy, Matthew Schlegel, Adam White, Martha White

Neural Information Processing Systems 

To achieve such a goal, directed exploration strategies are key.