An Empirical Study on the Overlapping Problem of Open-Domain Dialogue Datasets