Combining Supervised Learning and Reinforcement Learning for Multi-Label Classification Tasks with Partial Labels