Death of the Novel(ty): Beyond n-Gram Novelty as a Metric for Textual Creativity
Saakyan, Arkadiy, Kim, Najoung, Muresan, Smaranda, Chakrabarty, Tuhin
–arXiv.org Artificial Intelligence
N-gram novelty is widely used to evaluate language models' ability to generate text outside of their training data. More recently, it has also been adopted as a metric for measuring textual creativity. However, theoretical work on creativity suggests that this approach may be inadequate, as it does not account for creativity's dual nature: novelty (how original the text is) and appropriateness (how sensical and pragmatic it is). We investigate the relationship between this notion of creativity and n-gram novelty through 7542 expert writer annotations (n=26) of novelty, pragmaticality, and sensicality via close reading of human and AI-generated text. We find that while n-gram novelty is positively associated with expert writer-judged creativity, ~91% of top-quartile expressions by n-gram novelty are not judged as creative, cautioning against relying on n-gram novelty alone. Furthermore, unlike human-written text, higher n-gram novelty in open-source LLMs correlates with lower pragmaticality. In an exploratory study with frontier close-source models, we additionally confirm that they are less likely to produce creative expressions than humans. Using our dataset, we test whether zero-shot, few-shot, and finetuned models are able to identify creative expressions (a positive aspect of writing) and non-pragmatic ones (a negative aspect). Overall, frontier LLMs exhibit performance much higher than random but leave room for improvement, especially struggling to identify non-pragmatic expressions. We further find that LLM-as-a-Judge novelty scores from the best-performing model were predictive of expert writer preferences.
arXiv.org Artificial Intelligence
Sep-29-2025
- Country:
- Asia
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Singapore (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Middle East
- Europe
- Austria > Vienna (0.14)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Greater London > London (0.04)
- Oxfordshire > Oxford (0.04)
- North America > United States
- California > Ventura County
- Thousand Oaks (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- Minnesota (0.04)
- New York > Suffolk County
- Stony Brook (0.04)
- California > Ventura County
- Asia
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.95)
- Research Report
- Industry:
- Health & Medicine (0.93)
- Technology: