APRIL: Interactively Learning to Summarise by Combining Active Preference Learning and Reinforcement Learning

Gao, Yang, Meyer, Christian M., Gurevych, Iryna

Aug-29-2018–arXiv.org Artificial Intelligence

We propose a method to perform automatic document summarisation without using reference summaries. Instead, our method interactively learns from users' preferences. The merit of preference-based interactive summarisation is that preferences are easier for users to provide than reference summaries. Existing preference-based interactive learning methods suffer from high sample complexity, i.e. they need to interact with the oracle for many rounds in order to converge. In this work, we propose a new objective function, which enables us to leverage active learning, preference learning and reinforcement learning techniques in order to reduce the sample complexity. Both simulation and real-user experiments suggest that our method significantly advances the state of the art. Our source code is freely available at https://github.com/UKPLab/

artificial intelligence, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

Aug-29-2018

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - New South Wales > Sydney (0.04)
- North America
  - United States
    - New York (0.04)
    - Indiana (0.04)
    - Massachusetts > Hampshire County
      - Amherst (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - California > Los Angeles County
      - Long Beach (0.04)
    - Arizona > Maricopa County
      - Phoenix (0.04)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.14)
- Europe
  - United Kingdom > Scotland (0.04)
  - Netherlands (0.04)
  - Spain
    - Catalonia > Barcelona Province
      - Barcelona (0.04)
    - Andalusia > Granada Province
      - Granada (0.04)
  - Slovenia > Upper Carniola
    - Municipality of Bled > Bled (0.04)
  - Germany
    - Berlin (0.04)
    - North Rhine-Westphalia > Cologne Region
      - Bonn (0.04)
    - Hesse > Darmstadt Region
      - Darmstadt (0.04)
- Asia
  - South Korea (0.04)
  - Middle East > Qatar
    - Ad-Dawhah > Doha (0.04)
- Africa > Middle East
  - Libya > Tripoli District > Tripoli (0.04)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Government
  - Immigration & Customs (0.93)
  - Regional Government > North America Government
    - United States Government (0.68)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found