Adaptive Cross-Modal Few-shot Learning

Chen Xing, Negar Rostamzadeh, Boris Oreshkin, Pedro O. O. Pinheiro

Jan-27-2025, 09:31:17 GMT–Neural Information Processing Systems

Metric-based meta-learning techniques have successfully been applied to fewshot classification problems. In this paper, we propose to leverage cross-modal information to enhance metric-based few-shot learning methods. Visual and semantic feature spaces have different structures by definition. For certain concepts, visual features might be richer and more discriminative than text ones. While for others, the inverse might be true. Moreover, when the support from visual information is limited in image classification, semantic representations (learned from unsupervised text corpora) can provide strong prior knowledge and context to help learning.

artificial intelligence, latexit sha1, machine learning, (17 more...)

Neural Information Processing Systems

Jan-27-2025, 09:31:17 GMT

Conferences PDF

Add feedback

Country:
- North America > Canada (0.29)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)