APRIL: Interactively Learning to Summarise by Combining Active Preference Learning and Reinforcement Learning
Gao, Yang, Meyer, Christian M., Gurevych, Iryna
–arXiv.org Artificial Intelligence
We propose a method to perform automatic document summarisation without using reference summaries. Instead, our method interactively learns from users' preferences. The merit of preference-based interactive summarisation is that preferences are easier for users to provide than reference summaries. Existing preference-based interactive learning methods suffer from high sample complexity, i.e. they need to interact with the oracle for many rounds in order to converge. In this work, we propose a new objective function, which enables us to leverage active learning, preference learning and reinforcement learning techniques in order to reduce the sample complexity. Both simulation and real-user experiments suggest that our method significantly advances the state of the art. Our source code is freely available at https://github.com/UKPLab/
arXiv.org Artificial Intelligence
Aug-29-2018
- Country:
- Africa > Middle East
- Libya > Tripoli District > Tripoli (0.04)
- Asia
- Middle East > Qatar
- South Korea (0.04)
- Europe
- Germany
- Berlin (0.04)
- Hesse > Darmstadt Region
- Darmstadt (0.04)
- North Rhine-Westphalia > Cologne Region
- Bonn (0.04)
- Netherlands (0.04)
- Slovenia > Upper Carniola
- Municipality of Bled > Bled (0.04)
- Spain
- Andalusia > Granada Province
- Granada (0.04)
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Andalusia > Granada Province
- United Kingdom > Scotland (0.04)
- Germany
- North America
- Canada > British Columbia
- United States
- Arizona > Maricopa County
- Phoenix (0.04)
- California > Los Angeles County
- Long Beach (0.04)
- Indiana (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts > Hampshire County
- Amherst (0.04)
- New York (0.04)
- Arizona > Maricopa County
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Africa > Middle East
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Industry:
- Technology: