"You Are Rejected!": An Empirical Study of Large Language Models Taking Hiring Evaluations