Visual Prompt Tuning for Generative Transfer Learning

Sohn, Kihyuk, Hao, Yuan, Lezama, José, Polania, Luisa, Chang, Huiwen, Zhang, Han, Essa, Irfan, Jiang, Lu

Oct-3-2022–arXiv.org Artificial Intelligence

Transferring knowledge from an image synthesis model trained on a large dataset is a promising direction for learning generative image models from various domains efficiently. While previous works have studied GAN models, we present a recipe for learning vision transformers by generative knowledge transfer. We base our framework on state-of-the-art generative vision transformers that represent an image as a sequence of visual tokens to the autoregressive or non-autoregressive transformers. To adapt to a new domain, we employ prompt tuning, which prepends learnable tokens called prompt to the image token sequence, and introduce a new prompt design for our task. We study on a variety of visual domains, including visual task adaptation benchmark~\cite{zhai2019large}, with varying amount of training images, and show effectiveness of knowledge transfer and a significantly better image generation quality over existing works.

artificial intelligence, machine learning, transformer, (15 more...)

arXiv.org Artificial Intelligence

Oct-3-2022

arXiv.org PDF

Add feedback

Country:
- Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)

Genre:
- Research Report (0.64)
- Overview (0.46)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (1.00)
  - Artificial Intelligence
    - Vision (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found