Reinforcement Learning for ConnectX