Ahead-of-Time P-Tuning
Gavrilov, Daniil, Balagansky, Nikita
–arXiv.org Artificial Intelligence
In this paper, we propose Ahead-of-Time (AoT) P-Tuning, a novel parameter-efficient fine-tuning method for pre-trained Language Models (LMs) that adds input-dependent bias before each Transformer layer. We evaluate AoT P-Tuning on GLUE and SuperGLUE benchmarking datasets using RoBERTa and DeBERTa models, showing that it outperforms BitFit and is comparable or better than other baseline methods for efficient fine-tuning. Additionally, we assess the inference overhead of AoT P-Tuning and demonstrate that it introduces negligible overhead compared to established baseline methods. Our method enables multi-task inference with a single backbone LM, making it a practical solution for real-world applications.
arXiv.org Artificial Intelligence
May-18-2023
- Country:
- South America > Brazil (0.04)
- Africa > Niger (0.04)
- North America
- Dominican Republic (0.04)
- United States
- Massachusetts (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California > San Diego County
- San Diego (0.04)
- Europe
- United Kingdom (0.04)
- Romania > Sud - Muntenia Development Region
- Giurgiu County > Giurgiu (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Asia
- Genre:
- Research Report > New Finding (0.46)
- Technology: