7e487c72fce6e45879a78ee0872d991d-Paper-Conference.pdf
–Neural Information Processing Systems
In essence, MAE proposed an asymmetric encoder-decoder architecture for MIM, where the encoder (e.g., a standard ViT model [17])operates onlyonvisible patches, andthelight-weight decoder recoversallpatches for maskprediction.
Neural Information Processing Systems
Feb-10-2026, 04:05:20 GMT
- Technology: