Red Teaming for Large Language Models At Scale: Tackling Hallucinations on Mathematics Tasks