sacred text
Modeling the Sacred: Considerations when Using Religious Texts in Natural Language Processing
This position paper concerns the use of religious texts in Natural Language Processing (NLP), which is of special interest to the Ethics of NLP. Religious texts are expressions of culturally important values, and machine learned models have a propensity to reproduce cultural values encoded in their training data. Furthermore, translations of religious texts are frequently used by NLP researchers when language data is scarce. This repurposes the translations from their original uses and motivations, which often involve attracting new followers. This paper argues that NLP's use of such texts raises considerations that go beyond model biases, including data provenance, cultural contexts, and their use in proselytism. We argue for more consideration of researcher positionality, and of the perspectives of marginalized linguistic and religious communities.
- Oceania > Australia (0.14)
- North America > Canada > Ontario > Toronto (0.04)
- Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
- (9 more...)
- Law > Civil Rights & Constitutional Law (0.69)
- Health & Medicine (0.68)
- Law > International Law (0.46)
Efficacy of ByT5 in Multilingual Translation of Biblical Texts for Underrepresented Languages
Aars, Corinne, Adams, Lauren, Tian, Xiaokan, Wang, Zhaoyu, Wismer, Colton, Wu, Jason, Rivas, Pablo, Sooksatra, Korn, Fendt, Matthew
This study presents the development and evaluation of a ByT5-based multilingual translation model tailored for translating the Bible into underrepresented languages. Utilizing the comprehensive Johns Hopkins University Bible Corpus, we trained the model to capture the intricate nuances of character-based and morphologically rich languages. Our results, measured by the BLEU score and supplemented with sample translations, suggest the model can improve accessibility to sacred texts. It effectively handles the distinctive biblical lexicon and structure, thus bridging the linguistic divide. The study also discusses the model's limitations and suggests pathways for future enhancements, focusing on expanding access to sacred literature across linguistic boundaries.
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
Bible is providing data to help create AI that can can convert texts
Scientists are now using the Bible to help algorithms perfect their language skills. An AI has been trained on various versions of the sacred text so it can convert written works into different styles for different audiences. Each version of the Bible contains more than 31,000 verses that the researchers used to produce over 1.5 million unique pairings of source and target verses. The Bible is helping algorithms perfect their translation skills. Internet tools that translate text between languages like English and Spanish are widely available.