Goto

Collaborating Authors

 intertextuality


Quantitative Intertextuality from the Digital Humanities Perspective: A Survey

Duan, Siyu

arXiv.org Artificial Intelligence

The connection between texts is referred to as intertextuality in literary theory, which served as an important theoretical basis in many digital humanities studies. Over the past decade, advancements in natural language processing have ushered intertextuality studies into the quantitative age. Large-scale intertextuality research based on cutting-edge methods has continuously emerged. This paper provides a roadmap for quantitative intertextuality studies, summarizing their data, methods, and applications. Drawing on data from multiple languages and topics, this survey reviews methods from statistics to deep learning. It also summarizes their applications in humanities and social sciences research and the associated platform tools. Driven by advances in computer technology, more precise, diverse, and large-scale intertext studies can be anticipated. Intertextuality holds promise for broader application in interdisciplinary research bridging AI and the humanities.


Modelling Intertextuality with N-gram Embeddings

Xing, Yi

arXiv.org Artificial Intelligence

An intertextual link between Frances Burney's Cecilia and Jane Austen's Pride and Prejudice established by semantically similar trigrams Intertextuality, the allusive relationship between literary texts, is a fundamental concept in literary studies. It is the idea that texts are not isolated entities, but are interconnected through a network of references, allusions, and influences. Intertextuality is a key aspect of both literary creativity and interpretation, and it has been a popular research topic since it was put forward by the French semiotician Julia Kristeva in the 1960s ( Kristeva, 2024, 1968). Traditionally, the analysis of intertextuality has been a qualitative and interpretative endeavour, relying on close reading and critical judgement, and focusing only on a small number of texts.


Characterizing the Effects of Translation on Intertextuality using Multilingual Embedding Spaces

McGovern, Hope, Sirin, Hale, Lippincott, Tom

arXiv.org Artificial Intelligence

Rhetorical devices are difficult to translate, but they are crucial to the translation of literary documents. We investigate the use of multilingual embedding spaces to characterize the preservation of intertextuality, one common rhetorical device, across human and machine translation. To do so, we use Biblical texts, which are both full of intertextual references and are highly translated works. We provide a metric to characterize intertextuality at the corpus level and provide a quantitative analysis of the preservation of this rhetorical device across extant human translations and machine-generated counterparts. We go on to provide qualitative analysis of cases wherein human translations over- or underemphasize the intertextuality present in the text, whereas machine translations provide a neutral baseline. This provides support for established scholarship proposing that human translators have a propensity to amplify certain literary characteristics of the original manuscripts.


Latent Structures of Intertextuality in French Fiction

Barré, Jean

arXiv.org Artificial Intelligence

Intertextuality is a key concept in literary theory that challenges traditional notions of text, signification or authorship. It views texts as part of a vast intertextual network that is constantly evolving and being reconfigured. This paper argues that the field of computational literary studies is the ideal place to conduct a study of intertextuality since we have now the ability to systematically compare texts with each others. Specifically, we present a work on a corpus of more than 12.000 French fictions from the 18th, 19th and early 20th century. We focus on evaluating the underlying roles of two literary notions, sub-genres and the literary canon in the framing of textuality. The article attempts to operationalize intertextuality using state-of-the-art contextual language models to encode novels and capture features that go beyond simple lexical or thematic approaches. Previous research (Hughes, 2012) supports the existence of a literary "style of a time", and our findings further reinforce this concept. Our findings also suggest that both subgenres and canonicity play a significant role in shaping textual similarities within French fiction. These discoveries point to the importance of considering genre and canon as dynamic forces that influence the evolution and intertextual connections of literary works within specific historical contexts.


Investigating Expert-in-the-Loop LLM Discourse Patterns for Ancient Intertextual Analysis

Umphrey, Ray, Roberts, Jesse, Roberts, Lindsey

arXiv.org Artificial Intelligence

This study explores the potential of large language models (LLMs) for identifying and examining intertextual relationships within biblical, Koine Greek texts. By evaluating the performance of LLMs on various intertextuality scenarios the study demonstrates that these models can detect direct quotations, allusions, and echoes between texts. The LLM's ability to generate novel intertextual observations and connections highlights its potential to uncover new insights. However, the model also struggles with long query passages and the inclusion of false intertextual dependences, emphasizing the importance of expert evaluation. The expert-in-the-loop methodology presented offers a scalable approach for intertextual research into the complex web of intertextuality within and beyond the biblical corpus.

  Country:
  Genre: Research Report (0.50)

Graecia capta ferum victorem cepit. Detecting Latin Allusions to Ancient Greek Literature

Riemenschneider, Frederick, Frank, Anette

arXiv.org Artificial Intelligence

Intertextual allusions hold a pivotal role in Classical Philology, with Latin authors frequently referencing Ancient Greek texts. Until now, the automatic identification of these intertextual references has been constrained to monolingual approaches, seeking parallels solely within Latin or Greek texts. In this study, we introduce SPhilBERTa, a trilingual Sentence-RoBERTa model tailored for Classical Philology, which excels at cross-lingual semantic comprehension and identification of identical sentences across Ancient Greek, Latin, and English. We generate new training data by automatically translating English texts into Ancient Greek. Further, we present a case study, demonstrating SPhilBERTa's capability to facilitate automated detection of intertextual parallels. Our models and resources are available at https://github.com/Heidelberg-NLP/ancient-language-models.


Neural Kimono, or Intertextuality of AI

#artificialintelligence

Semiotician Julia Kristeva wrote in her essay "Word, Dialogue and Novel" [PDF] about texts, they were "absorption and transformation of another [texts]". She popularized the term "Intertextuality", which is There are no original texts; they are based on different previous works by continuously metamorphosing our narratives. Texts are textures -- they are material of our modern writing-based storytelling with its patterns, topics, ideas, and indeed have other features of everlasting change compared to everchanging stories, told over generations in Ancient times (in the way of "Chinese Whisper" game, with changing contents during timely transformed contexts). With the post-authorial storytelling in the times of AI, we have to rethink intertextuality as a generative driver. If the author-based text was created by the conscious and subconscious human body of thoughts (author's knowledge, experience, biases, stereotypes, preoccupations, distorted memories), Artificial Intelligence as creator is trained on previous datasets and generates new works by re-interpreting, re-combining known features, patterns, perspectives (like in self-attention based language model GPT-3).


Bioinformatics and Classical Literary Study

Chaudhuri, Pramit, Dexter, Joseph P.

arXiv.org Artificial Intelligence

This paper describes the Quantitative Criticism Lab, a collaborative initiative between classicists, quantitative biologists, and computer scientists to apply ideas and methods drawn from the sciences to the study of literature. A core goal of the project is the use of computational biology, natural language processing, and machine learning techniques to investigate authorial style, intertextuality, and related phenomena of literary significance. As a case study in our approach, here we review the use of sequence alignment, a common technique in genomics and computational linguistics, to detect intertextuality in Latin literature. Sequence alignment is distinguished by its ability to find inexact verbal similarities, which makes it ideal for identifying phonetic echoes in large corpora of Latin texts. Although especially suited to Latin, sequence alignment in principle can be extended to many other languages.


Automated Attribution and Intertextual Analysis

Brofos, James, Kannan, Ajay, Shu, Rui

arXiv.org Machine Learning

In this work, we employ quantitative methods from the realm of statistics and machine learning to develop novel methodologies for author attribution and textual analysis. In particular, we develop techniques and software suitable for applications to Classical study, and we illustrate the efficacy of our approach in several interesting open questions in the field. We apply our numerical analysis techniques to questions of authorship attribution in the case of the Greek tragedian Euripides, to instances of intertextuality and influence in the poetry of the Roman statesman Seneca the Younger, and to cases of "interpolated" text with respect to the histories of Livy.