Reinforcement Learning to solve Rubik's cube (and other complex problems!)