Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization