Diverse and Effective Red Teaming with Auto-generated Rewards and Multi-step Reinforcement Learning