LLMs are Superior Feedback Providers: Bootstrapping Reasoning for Lie Detection with Self-Generated Feedback
Banerjee, Tanushree, Zhu, Richard, Yang, Runzhe, Narasimhan, Karthik
–arXiv.org Artificial Intelligence
Large Language Models (LLMs) excel at generating human-like dialogues and comprehending text. However, understanding the subtleties of complex exchanges in language remains a challenge. We propose a bootstrapping framework that leverages self-generated feedback to enhance LLM reasoning capabilities for lie detection. The framework consists of three stages: suggestion, feedback collection, and modification. In the suggestion stage, a cost-effective language model generates initial predictions based on game state and dialogue. The feedback-collection stage involves a language model providing feedback on these predictions. In the modification stage, a more advanced language model refines the initial predictions using the auto-generated feedback. We investigate the application of the proposed framework for detecting betrayal and deception in Diplomacy games, and compare it with feedback from professional human players. The LLM-generated feedback exhibits superior quality and significantly enhances the performance of the model. Our approach achieves a 39% improvement over the zero-shot baseline in lying-F1 without the need for any training data, rivaling state-of-the-art supervised learning results.
arXiv.org Artificial Intelligence
Aug-25-2024
- Country:
- North America
- Europe
- Denmark (0.14)
- United Kingdom > England (0.06)
- Russia (0.06)
- Austria (0.05)
- North Sea (0.04)
- Norway (0.04)
- Sweden (0.04)
- Greece (0.04)
- Belgium (0.04)
- Bulgaria (0.04)
- Albania (0.04)
- Germany > Bavaria
- Upper Bavaria > Munich (0.05)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Italy > Friuli Venezia Giulia
- Trieste Province > Trieste (0.04)
- Atlantic Ocean > North Atlantic Ocean
- North Sea (0.04)
- English Channel (0.04)
- Asia
- Russia (0.06)
- Middle East
- Republic of Türkiye (0.05)
- Jordan (0.04)
- China > Beijing
- Beijing (0.04)
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Leisure & Entertainment > Games (1.00)
- Technology: