Thanks for the suggestion. We will

Neural Information Processing Systems 

Genral response: We thank all reviewers for their constructive comments. Below is our response for common questions. BERT models; and (iii) is more environmentally friendly due to weight sharing. Q1."whether this approach can be adapted to work during the pre-training phase" Q1."paper quite dense and hard to read...rely on various complicated procedures", "if there is a We will continue thinking about simplifying the method. Q2."T able 4, what exactly is'fine-tuning'?":This is the'fine-tuning' mentioned in Lines 138-139 in Section 2.2. I can't work out what this sentence means."

Similar Docs  Excel Report  more

TitleSimilaritySource
None found