Towards a fully RL-based Market Simulator