Zero-shot Conversational Summarization Evaluations with small Large Language Models

Manuvinakurike, Ramesh, Sahay, Saurav, Manepalli, Sangeeta, Nachman, Lama

Nov-29-2023–arXiv.org Artificial Intelligence

However, their capabilities on conversational summarization remains under explored. In this work we evaluate LLMs ( 10 billion parameters) on conversational summarization and showcase their performance on various prompts. We show that the summaries generated by models depend on the instructions and the performance of LLMs vary with different instructions sometimes resulting steep drop in ROUGE scores if prompts are not selected carefully. We also evaluate the models with human evaluations and discuss the limitations of the models on conversational summarization.

dialogue, dialogue 0, summarize, (15 more...)

arXiv.org Artificial Intelligence

Nov-29-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.67)
- Europe
  - Italy > Calabria
    - Catanzaro Province > Catanzaro (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)
- Asia > China
  - Beijing > Beijing (0.04)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine (1.00)
- Government > Regional Government (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)