A Data and

Neural Information Processing Systems 

We tried removing and keeping the comments in the code from our training data. As shown in Table 6, keeping the comments gives better results overall. Detailed statistics of the resulting dataset can be found in Table 3. We give the size in GigaBytes, the number of files and functions, and the number of tokens. We show two versions of the same Python function and their common tokenization.

Duplicate Docs Excel Report

Similar Docs  Excel Report  more

TitleSimilaritySource
None found