Predicting masked tokens in stochastic locations improves masked image modeling

Open in new window