Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
–Neural Information Processing Systems
Pretraining on noisy, internet-scale datasets has been heavily studied as a technique for training models with broad, general capabilities for text, images, and other modalities.
Neural Information Processing Systems
Feb-9-2025, 14:39:04 GMT
- Genre:
- Research Report > New Finding (0.46)