Average Is Not Enough: Caveats of Multilingual Evaluation

Jan-3-2023–arXiv.org Artificial Intelligence

We believe that this to improvements of various multilingual technologies, is an often overlooked tool in our research toolkit such as machine translation (Arivazhagan that should be used more to ensure that we are et al., 2019), multilingual language models (Devlin able to properly interpret results from multilingual et al., 2019; Conneau and Lample, 2019), crosslingual evaluation and detect various linguistic biases and transfer learning (Pikuliak et al., 2021) or problems. In addition to this discussion, which language independent representations (Ruder et al., we consider a contribution in itself, we also propose 2019). It is now possible to create well-performing a visualization based on URIEL typological multilingual methods for many tasks. When dealing database (Littell et al., 2017) as an example of such with multilingual methods, we need to be able qualitative analysis, and we show that it is able to to evaluate how good they really are, i.e. how effective discover linguistic biases in published results.

computational linguistic, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Jan-3-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > Minnesota
    - Hennepin County > Minneapolis (0.14)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
- Europe
  - Spain > Valencian Community
    - Valencia Province > Valencia (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - Greece > Attica
    - Athens (0.04)
- Asia
  - Indonesia > Bali (0.04)
  - China > Hong Kong (0.04)

Genre:
- Research Report > New Finding (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found