Evaluation of Large Language Models: STEM education and Gender Stereotypes