InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding
–Neural Information Processing Systems
Soft prompt tuning achieves superior performances across a wide range of few-shot tasks. However, the performances of prompt tuning can be highly sensitive to the initialization of the prompts. We have also empirically observed that conventional prompt tuning methods cannot encode and learn sufficient task-relevant information from prompt tokens. In this work, we develop an information-theoretic framework that formulates soft prompt tuning as maximizing the mutual information between prompts and other model parameters (or encoded representations). This novel view helps us to develop a more efficient, accurate and robust soft prompt tuning method, InfoPrompt.
Neural Information Processing Systems
Jan-19-2025, 21:10:53 GMT
- Technology: