Music Generation using Human-In-The-Loop Reinforcement Learning