Dr Genre: Reinforcement Learning from Decoupled LLM Feedback for Generic Text Rewriting

Li, Yufei, Nham, John, Jawahar, Ganesh, Shu, Lei, Uthus, David, Sung, Yun-Hsuan, Yang, Chengrun, Rolnick, Itai, Qiao, Yi, Liu, Cong

Mar-9-2025–arXiv.org Artificial Intelligence

Generic text rewriting is a prevalent large language model (LLM) application that covers diverse real-world tasks, such as style transfer, fact correction, and email editing. These tasks vary in rewriting objectives (e.g., factual consistency vs. semantic preservation), making it challenging to develop a unified model that excels across all dimensions. Existing methods often specialize in either a single task or a specific objective, limiting their generalizability. In this work, we introduce a generic model proficient in factuality, stylistic, and conversational rewriting tasks. To simulate real-world user rewrite requests, we construct a conversational rewrite dataset, ChatRewrite, that presents ``natural''-sounding instructions, from raw emails using LLMs. Combined with other popular rewrite datasets, including LongFact for the factuality rewrite task and RewriteLM for the stylistic rewrite task, this forms a broad benchmark for training and evaluating generic rewrite models. To align with task-specific objectives, we propose Dr Genre, a Decoupled-reward learning framework for Generic rewriting, that utilizes objective-oriented reward models with a task-specific weighting. Evaluation shows that \approach delivers higher-quality rewrites across all targeted tasks, improving objectives including instruction following (agreement), internal consistency (coherence), and minimal unnecessary edits (conciseness).

email, instruction, rewrite instruction, (11 more...)

arXiv.org Artificial Intelligence

Mar-9-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Hawaii (0.04)
  - California > San Francisco County
    - San Francisco (0.04)
- Europe > Middle East
  - Malta > Eastern Region > Northern Harbour District > St. Julian's (0.04)
- Asia
  - China (0.28)
  - East Asia (0.04)
  - Thailand > Bangkok
    - Bangkok (0.04)
  - Middle East > UAE
    - Abu Dhabi Emirate > Abu Dhabi (0.04)

Genre:
- Personal (1.00)
- Research Report (0.63)

Industry:
- Health & Medicine > Therapeutic Area (0.93)
- Leisure & Entertainment > Games
  - Computer Games (0.94)
- Government
  - Regional Government (0.68)
  - Military (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found