Efficient transfer learning for NLP with ELECTRA
–arXiv.org Artificial Intelligence
Scope of Reproducibility Clark et al. [2020] claims that the ELECTRA approach is highly efficient in NLP performances relative to computation budget. As such, this study focus on this claim, summarized by the following question: Can we use ELECTRA to achieve close to SOTA performances for NLP in low-resource settings, in term of compute cost? Methodology This replication study has been conducted by fully reimplementing the small variant of the original ELECTRA model (Clark et al. [2020]). All experiments are performed on single GPU computers. GLUE benchmark dev set (Wang et al. [2018]) is used for models evaluation and compared with the original paper.
arXiv.org Artificial Intelligence
Apr-6-2021
- Country:
- North America > Canada
- Europe > Belgium
- Brussels-Capital Region > Brussels (0.04)
- Genre:
- Research Report (0.82)
- Technology: