DesCo: Learning Object Recognition with Rich Language Descriptions

Neural Information Processing Systems 

Recent development in vision-language approaches has instigated a paradigm shift in learning visual recognition models from language supervision. These approaches align objects with language queries (e.g. "a photo of a cat") and thus improve the models' adaptability to novel objects and domains.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found