EgocentricVideo-LanguagePretraining

Neural Information Processing Systems 

As illustrated in Tab. 1, the formerly largest egocentric video dataset EPICKITCHENS-100 [14] focuses on kitchens scenarios and its size is far smaller than those of the 3rd-person pretraining sets WebVid-2M [3] and HowTo100M [10].

Similar Docs  Excel Report  more

TitleSimilaritySource
None found