were also surprised that they perform poorly, so we changed to optimizing individual rewards (easier settings) to