Visual Tuning
Yu, Bruce X. B., Chang, Jianlong, Wang, Haixin, Liu, Lingbo, Wang, Shijie, Wang, Zhiyu, Lin, Junfan, Xie, Lingxi, Li, Haojie, Lin, Zhouchen, Tian, Qi, Chen, Chang Wen
–arXiv.org Artificial Intelligence
Fine-tuning visual models has been widely shown promising performance on many downstream visual tasks. With the surprising development of pre-trained visual foundation models, visual tuning jumped out of the standard modus operandi that fine-tunes the whole pre-trained model or just the fully connected layer. Instead, recent advances can achieve superior performance than full-tuning the whole pre-trained parameters by updating far fewer parameters, enabling edge devices and downstream applications to reuse the increasingly large foundation models deployed on the cloud. With the aim of helping researchers get the full picture and future directions of visual tuning, this survey characterizes a large and thoughtful selection of recent works, providing a systematic and comprehensive overview of existing work and models. Specifically, it provides a detailed background of visual tuning and categorizes recent visual tuning techniques into five groups: prompt tuning, adapter tuning, parameter tuning, and remapping tuning. Meanwhile, it offers some exciting research directions for prospective pre-training and various interactions in visual tuning.
arXiv.org Artificial Intelligence
May-10-2023
- Country:
- Europe
- Greece (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- Netherlands > North Holland
- Amsterdam (0.04)
- Asia
- Middle East
- Jordan (0.04)
- Israel > Tel Aviv District
- Tel Aviv (0.04)
- China
- Liaoning Province > Dalian (0.04)
- Hong Kong (0.04)
- Middle East
- Europe
- Genre:
- Research Report (1.00)
- Overview (1.00)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Education (1.00)
- Health & Medicine > Therapeutic Area (0.67)
- Technology:
- Information Technology
- Sensing and Signal Processing > Image Processing (1.00)
- Security & Privacy (1.00)
- Data Science (1.00)
- Communications > Networks (1.00)
- Artificial Intelligence
- Vision (1.00)
- Robots (1.00)
- Representation & Reasoning (1.00)
- Cognitive Science (1.00)
- Natural Language > Large Language Model (0.93)
- Machine Learning
- Neural Networks > Deep Learning (1.00)
- Statistical Learning (0.67)
- Information Technology