Towards a reinforcement learning de novo genome assembler