Context-dependent upper-confidence bounds for directed exploration