Detecting Structured Language Alternations in Historical Documents by Combining Language Identification with Fourier Analysis
Sirin, Hale, Li, Sabrina, Lippincott, Tom
–arXiv.org Artificial Intelligence
In this study, we present a generalizable workflow to identify documents in a historic language with a nonstandard language and script combination, Armeno-Turkish. We introduce the task of detecting distinct patterns of multilinguality based on the frequency of structured language alternations within a document.
arXiv.org Artificial Intelligence
Jan-25-2024
- Country:
- Africa > Middle East (0.04)
- Asia > Middle East
- Republic of Türkiye > Istanbul Province > Istanbul (0.05)
- Europe
- Austria > Vienna (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.05)
- Norway > Eastern Norway
- Genre:
- Research Report (0.71)
- Technology: