main questions raised in the reviews. 2 Reviewer

Neural Information Processing Systems 

We thank the Reviewers for their thoughtful assessment of our work and valuable comments. We will work on improving the writing for the final version, as suggested. The test can naturally be applied at any point of the training process to see if overfitting has happened. We used different random seeds for each training process. Indeed, hyperparameter selection is one of the potential sources of overfitting.