e33d974aae13e4d877477d51d8bafdc4-AuthorFeedback.pdf

Neural Information Processing Systems 

We would like to thank all five (!) reviewers for their detailed reviews and their suggestions / questions, which will help In the following we will try to address the main points raised. Due to space constraints, we had unfortunately shortened this part of the paper too much, as we now realize. 'not enough' on the other features it depends on, we call this'overfitting towards the identity function' in this paper. B (which is controlled by the value of dropout-probability p, or Λ), see Eq. 6. We find it remarkable in l. 154-6 (reviewer 4) that training (diagonal removed) differs from prediction (with diagonal).

Similar Docs  Excel Report  more

TitleSimilaritySource
None found