JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry
Afzal, Anum, Mercier, Alexandre, Matthes, Florian
–arXiv.org Artificial Intelligence
Online platforms are increasingly interested in using Data-to-Text technologies to generate content and help their users. Unfortunately, traditional generative methods often fall into repetitive patterns, resulting in monotonous galleries of texts after only a few iterations. In this paper, we investigate LLM-based data-to-text approaches to automatically generate marketing texts that are of sufficient quality and diverse enough for broad adoption. We leverage Language Models such as T5, GPT -3.5, GPT -4, and LLaMa2 in conjunction with fine-tuning, few-shot, and zero-shot approaches to set a baseline for diverse marketing texts. We also introduce a metric JaccDiv to evaluate the diversity of a set of texts. This research extends its relevance beyond the music industry, proving beneficial in various fields where repetitive automated content generation is prevalent.
arXiv.org Artificial Intelligence
Apr-30-2025
- Country:
- Asia
- Europe
- Germany
- Bavaria > Upper Bavaria
- Munich (0.04)
- North Rhine-Westphalia > Upper Bavaria
- Munich (0.04)
- Bavaria > Upper Bavaria
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Netherlands (0.04)
- Norway > Norwegian Sea (0.04)
- Spain > Galicia
- A Coruña Province > Santiago de Compostela (0.04)
- Germany
- North America
- Canada > Ontario
- Toronto (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- United States (0.04)
- Canada > Ontario
- South America
- Genre:
- Research Report (1.00)
- Technology: