Historical Ink: 19th Century Latin American Spanish Newspaper Corpus with LLM OCR Correction
Manrique-Gómez, Laura, Montes, Tony, Manrique, Rubén
–arXiv.org Artificial Intelligence
Another substantial as key historical resources, contain a diverse project is the "Digging into Data Challenge". A range of information about political, economic, part of the Transatlantic Partnership for Social Sciences and cultural processes and are abundant due to and Humanities 2016, this initiative yielded focused efforts to preserve them within national a vast collection of 19th-century press materials archives. Indeed, the discipline of Digital Humanities, known as "Atlas - Oceanic Exchanges. Tracing which emphasizes the incorporation of digital Global Information Networks in Historical Papers" tools in humanities and social sciences research, (Exchanges). Other significant works include "Viral has spent much of the past three decades on the Texts: Mapping Networks of Reprinting in 19th-task of digitization, resulting in a wealth of curated Century Newspapers and Magazines" (Cordell and digital collections (Berry and Fagerjord, 2017; Dobson, Smith), a project that investigates 19th-century journalistic 2019). However, digitizing these corpora has reports to understand the culture of reprinting brought plenty of challenges in transcribing the in the United States before the Civil War, and images into machine-readable texts.
arXiv.org Artificial Intelligence
Jul-3-2024
- Country:
- North America > United States (0.67)
- South America > Colombia
- Bogotá D.C. (0.14)
- Genre:
- Research Report (0.40)
- Technology: