Supplementary Material: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers Wenhui Wang Furu Wei

Neural Information Processing Systems 

Given an input passage and an answer, the task is to generate a question that asks for the answer.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found