Research Papers based on Off-Policy based Reinforcement Learning