CovScore: Evaluation of Multi-Document Abstractive Title Set Generation

Jul-24-2024–arXiv.org Artificial Intelligence

This paper introduces CovScore, an automatic reference-less methodology for evaluating thematic title sets, extracted from a corpus of documents. While such extraction methods are widely used, evaluating their effectiveness remains an open question. Moreover, some existing practices heavily rely on slow and laborious human annotation procedures. Inspired by recently introduced LLM-based judge methods, we propose a novel methodology that decomposes quality into five main metrics along different aspects of evaluation. This framing simplifies and expedites the manual evaluation process and enables automatic and independent LLM-based evaluation. As a test case, we apply our approach to a corpus of Holocaust survivor testimonies, motivated both by its relevance to title set extraction and by the moral significance of this pursuit. We validate the methodology by experimenting with naturalistic and synthetic title set generation systems and compare their performance with the methodology.

annotator, arxiv preprint arxiv, testimony, (12 more...)

arXiv.org Artificial Intelligence

Jul-24-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Minnesota > Hennepin County > Minneapolis (0.14)
- Europe
  - Poland (0.05)
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
  - Hungary > Budapest
    - Budapest (0.04)
- Asia
  - Nepal (0.04)
  - Middle East
    - Jordan (0.04)
    - Israel > Jerusalem District
      - Jerusalem (0.04)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Government (1.00)
- Law > Civil Rights & Constitutional Law (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (0.96)
  - Machine Learning > Neural Networks
    - Deep Learning (0.96)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found