Real or Robotic? Assessing Whether LLMs Accurately Simulate Qualities of Human Responses in Dialogue