Measuring Stereotype and Deviation Biases in Large Language Models