0346c148ba1c21c6b4780a961ea141dc-Supplemental-Conference.pdf

Neural Information Processing Systems 

Table 7: Extensions of Table 1 with more details of prompts used to generate class-conditioned texts for different GLUE tasks. SST-2 and CoLA are single-sequence classification tasks and the rest are sequence-pair classification tasks. Generation for CoLA does not use prompts but by varying sampling temperatures. Text generation with CTRL [23] requires starting with control codes, and we use the ones that correspond to the pretraining corpus where the first sequence is sampled: For MNLI, RTE and MRPC, the first sequence is sampled from Wikipedia; for QNLI and QQP, the first sequence is sampled from OpenWebText [17]. The prompts used for SST-2 are part of the CTRL [23] codes. Furthermore, xg contradiction There is a rumor that xs.