Reward Model Overoptimisation in Iterated RLHF