CovScore: Evaluation of Multi-Document Abstractive Title Set Generation
–arXiv.org Artificial Intelligence
This paper introduces CovScore, an automatic reference-less methodology for evaluating thematic title sets, extracted from a corpus of documents. While such extraction methods are widely used, evaluating their effectiveness remains an open question. Moreover, some existing practices heavily rely on slow and laborious human annotation procedures. Inspired by recently introduced LLM-based judge methods, we propose a novel methodology that decomposes quality into five main metrics along different aspects of evaluation. This framing simplifies and expedites the manual evaluation process and enables automatic and independent LLM-based evaluation. As a test case, we apply our approach to a corpus of Holocaust survivor testimonies, motivated both by its relevance to title set extraction and by the moral significance of this pursuit. We validate the methodology by experimenting with naturalistic and synthetic title set generation systems and compare their performance with the methodology.
arXiv.org Artificial Intelligence
Jul-24-2024
- Country:
- Asia
- Middle East
- Israel > Jerusalem District
- Jerusalem (0.04)
- Jordan (0.04)
- Israel > Jerusalem District
- Nepal (0.04)
- Middle East
- Europe
- Hungary > Budapest
- Budapest (0.04)
- Poland (0.05)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Hungary > Budapest
- North America > United States
- Minnesota > Hennepin County > Minneapolis (0.14)
- Asia
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Government (1.00)
- Law > Civil Rights & Constitutional Law (0.46)
- Technology: