Identifying Fairness Issues in Automatically Generated Testing Content