ICLEF: In-Context Learning with Expert Feedback for Explainable Style Transfer
Saakyan, Arkadiy, Muresan, Smaranda
–arXiv.org Artificial Intelligence
While state-of-the-art language models excel at the style transfer task, current work does not address explainability of style transfer systems. Explanations could be generated using large language models such as GPT-3.5 and GPT-4, but the use of such complex systems is inefficient when smaller, widely distributed, and transparent alternatives are available. We propose a framework to augment and improve a formality style transfer dataset with explanations via model distillation from ChatGPT. To further refine the generated explanations, we propose a novel way to incorporate scarce expert human feedback using in-context learning (ICLEF: In-Context Learning from Expert Feedback) by prompting ChatGPT to act as a critic to its own outputs. We use the resulting dataset of 9,960 explainable formality style transfer instances (e-GYAFC) to show that current openly distributed instruction-tuned models (and, in some settings, ChatGPT) perform poorly on the task, and that fine-tuning on our high-quality dataset leads to significant improvements as shown by automatic evaluation. In human evaluation, we show that models much smaller than ChatGPT fine-tuned on our data align better with expert preferences. Finally, we discuss two potential applications of models fine-tuned on the explainable style transfer task: interpretable authorship verification and interpretable adversarial attacks on AI-generated text detectors.
arXiv.org Artificial Intelligence
Sep-15-2023
- Country:
- North America
- Dominican Republic (0.04)
- United States
- Pennsylvania (0.04)
- Washington > King County
- Seattle (0.04)
- New York > New York County
- New York City (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Europe
- Italy
- Tuscany > Florence (0.04)
- Emilia-Romagna > Metropolitan City of Bologna
- Bologna (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy
- Asia
- China > Hong Kong (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.14)
- North America
- Genre:
- Research Report (0.64)
- Industry:
- Information Technology > Security & Privacy (0.34)
- Government > Military (0.34)
- Technology: