Zero-Shot Learning Through Cross-Modal Transfer
Socher, Richard, Ganjoo, Milind, Manning, Christopher D., Ng, Andrew
–Neural Information Processing Systems
This work introduces a model that can recognize objects in images even if no training data is available for the object class. The only necessary knowledge about unseen categories comes from unsupervised text corpora. Unlike previous zero-shot learning models, which can only differentiate between unseen classes, our model can operate on a mixture of objects, simultaneously obtaining state of the art performance on classes with thousands of training images and reasonable performance on unseen classes. This is achieved by seeing the distributions of words in texts as a semantic space for understanding what objects look like. Our deep learning model does not require any manually defined semantic or visual features for either words or images.
Neural Information Processing Systems
Feb-14-2020, 15:57:59 GMT
- Technology: