Pareto Probing: Trading Off Accuracy for Complexity
Pimentel, Tiago, Saphra, Naomi, Williams, Adina, Cotterell, Ryan
–arXiv.org Artificial Intelligence
The question of how to probe contextual word representations for linguistic structure in a way that is both principled and useful has seen significant attention recently in the NLP literature. In our contribution to this discussion, we argue for a probe metric that reflects the fundamental trade-off between probe complexity and performance: the Pareto hypervolume. To measure complexity, we present a number of parametric and non-parametric metrics. Our experiments using Pareto hypervolume as an evaluation metric show that probes often do not conform to our expectations -- e.g., why should the non-contextual fastText representations encode more morpho-syntactic information than the contextual BERT representations? These results suggest that common, simplistic probing tasks, such as part-of-speech labeling and dependency arc labeling, are inadequate to evaluate the linguistic structure encoded in contextual word representations. This leads us to propose full dependency parsing as a probing task. In support of our suggestion that harder probing tasks are necessary, our experiments with dependency parsing reveal a wide gap in syntactic knowledge between contextual and non-contextual representations.
arXiv.org Artificial Intelligence
Dec-4-2023
- Country:
- Oceania > Australia
- North America > United States
- Minnesota > Hennepin County > Minneapolis (0.14)
- Europe
- Czechia > Prague (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.14)
- Switzerland > Zürich
- Zürich (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Italy > Tuscany
- Florence (0.04)
- Finland > Southwest Finland
- Turku (0.04)
- Asia
- China > Hong Kong (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Japan > Honshū
- Tōhoku > Iwate Prefecture
- Morioka (0.04)
- Kansai > Osaka Prefecture
- Osaka (0.04)
- Tōhoku > Iwate Prefecture
- Genre:
- Research Report > New Finding (0.48)
- Technology: