Deep Reinforcement Learning for 5*5 Multiplayer Go