Beyond DQN/A3C: A Survey in Advanced Reinforcement Learning