Generalised agent for solving higher board states of tic tac toe using Reinforcement Learning