Revealing User Familiarity Bias in Task-Oriented Dialogue via Interactive Evaluation