7e487c72fce6e45879a78ee0872d991d-Paper-Conference.pdf

Neural Information Processing Systems 

In essence, MAE proposed an asymmetric encoder-decoder architecture for MIM, where the encoder (e.g., a standard ViT model [17])operates onlyonvisible patches, andthelight-weight decoder recoversallpatches for maskprediction.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found