Goto

Collaborating Authors

 negative caption




DesCo: Learning Object Recognition with Rich Language Descriptions

Neural Information Processing Systems

Recent development in vision-language approaches has instigated a paradigm shift in learning visual recognition models from language supervision. These approaches align objects with language queries (e.g. "a photo of a cat") and thus improve the models' adaptability to novel objects and domains.








DesCo: Learning Object Recognition with Rich Language Descriptions

Neural Information Processing Systems

Recent development in vision-language approaches has instigated a paradigm shift in learning visual recognition models from language supervision. These approaches align objects with language queries (e.g. "a photo of a cat") and thus improve the models' adaptability to novel objects and domains.