How Stylistic Similarity Shapes Preferences in Dialogue Dataset with User and Third Party Evaluations