Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

Neural Information Processing Systems 

Pretraining on noisy, internet-scale datasets has been heavily studied as a technique for training models with broad, general capabilities for text, images, and other modalities.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found