Deep Reinforcement Learning for Modelling Protein Complexes