RISE: Leveraging Retrieval Techniques for Summarization Evaluation
–arXiv.org Artificial Intelligence
Evaluating automatically-generated text summaries is a challenging task. While there have been many interesting approaches, they still fall short of human evaluations. We present RISE, a new approach for evaluating summaries by leveraging techniques from information retrieval. RISE is first trained as a retrieval task using a dual-encoder retrieval setup, and can then be subsequently utilized for evaluating a generated summary given an input document, without gold reference summaries. RISE is especially well suited when working on new datasets where one may not have reference summaries available for evaluation. We conduct comprehensive experiments on the SummEval benchmark (Fabbri et al., 2021) and the results show that RISE has higher correlation with human evaluations compared to many past approaches to summarization evaluation. Furthermore, RISE also demonstrates data-efficiency and generalizability across languages.
arXiv.org Artificial Intelligence
May-22-2023
- Country:
- South America > Chile
- North America
- Dominican Republic (0.04)
- United States
- Pennsylvania (0.04)
- Washington > King County
- Seattle (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Europe
- Germany > Berlin (0.04)
- United Kingdom > England
- Lincolnshire > Scunthorpe (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Italy > Tuscany
- Florence (0.04)
- Asia
- China > Hong Kong (0.04)
- British Indian Ocean Territory > Diego Garcia (0.04)
- Genre:
- Research Report > New Finding (0.66)
- Technology: