Thoughts on Objectives of Sparse and Hierarchical Masked Image Model

Miyazaki, Asahi, Okita, Tsuyoshi

arXiv.org Artificial Intelligence 

Masked image modeling is one of the most poplular objectives of training. Recently, the SparK model has been proposed with superior performance among self-supervised learning models. This paper proposes a new mask pattern for this SparK model, proposing it as the Mesh Mask-ed SparK model. We report the effect of the mask pattern used for image masking in pre-training on performance.