Evaluating LLM-Generated Versus Human-Authored Responses in Role-Play Dialogues