PoBRL: Optimizing Multi-Document Summarization by Blending Reinforcement Learning Policies
Su, Andy, Su, Difei, Mulvey, John M., Poor, H. Vincent
–arXiv.org Artificial Intelligence
We propose a novel reinforcement learning based framework PoBRL for solving multi-document summarization. PoBRL jointly optimizes over the following three objectives necessary for a high-quality summary: importance, relevance, and length. Our strategy decouples this multi-objective optimization into different subproblems that can be solved individually by reinforcement learning. Utilizing PoBRL, we then blend each learned policies together to produce a summary that is a concise and complete representation of the original input. Our empirical analysis shows state-of-the-art performance on several multi-document datasets. Human evaluation also shows that our method produces high-quality output.
arXiv.org Artificial Intelligence
May-17-2021
- Country:
- Oceania > Australia
- North America
- Canada > British Columbia (0.04)
- United States
- District of Columbia > Washington (0.04)
- Colorado (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California > Los Angeles County
- Los Angeles (0.14)
- Long Beach (0.04)
- Europe
- Germany > Berlin (0.04)
- Netherlands (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Italy > Tuscany
- Florence (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Genre:
- Research Report (0.64)
- Technology: