Introduction to Neural Transfer Learning with Transformers for Social Science Text Analysis
–arXiv.org Artificial Intelligence
Transformer-based models for transfer learning have the potential to achieve high prediction accuracies on text-based supervised learning tasks with relatively few training data instances. These models are thus likely to benefit social scientists that seek to have as accurate as possible text-based measures but only have limited resources for annotating training data. To enable social scientists to leverage these potential benefits for their research, this paper explains how these methods work, why they might be advantageous, and what their limitations are. Additionally, three Transformer-based models for transfer learning, BERT (Devlin et al. 2019), RoBERTa (Liu et al. 2019), and the Longformer (Beltagy et al. 2020), are compared to conventional machine learning algorithms on three applications. Across all evaluated tasks, textual styles, and training data set sizes, the conventional models are consistently outperformed by transfer learning with Transformers, thereby demonstrating the benefits these models can bring to text-based social science research.
arXiv.org Artificial Intelligence
Aug-31-2022
- Country:
- North America
- United States > California
- Santa Clara County > Palo Alto (0.04)
- Canada > Ontario
- Toronto (0.04)
- United States > California
- Europe
- Ireland (0.04)
- France (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Italy > Marche
- Ancona Province > Ancona (0.04)
- Germany > North Rhine-Westphalia
- Upper Bavaria > Munich (0.04)
- North America
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Education (0.92)
- Government > Regional Government (0.45)
- Technology:
- Information Technology > Artificial Intelligence
- Natural Language
- Text Processing (1.00)
- Large Language Model (1.00)
- Information Extraction (0.92)
- Grammars & Parsing (0.92)
- Machine Learning
- Transfer Learning (1.00)
- Statistical Learning (1.00)
- Neural Networks > Deep Learning (1.00)
- Inductive Learning (1.00)
- Natural Language
- Information Technology > Artificial Intelligence