AITopics | zsl

Domain-Invariant Projection Learning for Zero-Shot Recognition

Neural Information Processing SystemsMar-13-2026, 17:41:57 GMT

Zero-shot learning (ZSL) aims to recognize unseen object classes without any training samples, which can be regarded as a form of transfer learning from seen classes to unseen ones. This is made possible by learning a projection between a feature space and a semantic space (e.g.

large language model, machine learning, natural language, (8 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.77)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.30)

Add feedback

Domain-Invariant Projection Learning for Zero-Shot Recognition

An Zhao, Mingyu Ding, Jiechao Guan, Zhiwu Lu, Tao Xiang, Ji-Rong Wen

Neural Information Processing SystemsFeb-14-2026, 15:02:44 GMT

Neural Information Processing Systems http://nips.cc/

learning, superclass, zsl, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.04)
North America > United States > California (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

fa2431bf9d65058fe34e9713e32d60e6-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 05:06:13 GMT

image representation, prototype, zero-shot learning, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.04)
North America > United States > California (0.04)
North America > Canada (0.04)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.67)

Add feedback

HSVA: Hierarchical Semantic-Visual Adaptation for Zero-Shot Learning

Neural Information Processing SystemsDec-24-2025, 10:37:14 GMT

Zero-shot learning (ZSL) tackles the unseen class recognition problem, transferring semantic knowledge from seen classes to unseen ones. Typically, to guarantee desirable knowledge transfer, a common (latent) space is adopted for associating the visual and semantic domains in ZSL. However, existing common space learning methods align the semantic and visual domains by merely mitigating distribution disagreement through one-step adaptation. This strategy is usually ineffective due to the heterogeneous nature of the feature representations in the two domains, which intrinsically contain both distribution and structure variations. To address this and advance ZSL, we propose a novel hierarchical semantic-visual adaptation (HSVA) framework.

adaptation, hierarchical semantic-visual adaptation, hsva, (11 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.61)
Information Technology > Artificial Intelligence > Machine Learning (0.55)

Add feedback

Domain-Invariant Projection Learning for Zero-Shot Recognition

Neural Information Processing SystemsNov-20-2025, 22:59:30 GMT

Zero-shot learning (ZSL) aims to recognize unseen object classes without any training samples, which can be regarded as a form of transfer learning from seen classes to unseen ones. This is made possible by learning a projection between a feature space and a semantic space (e.g.

domain-invariant projection learning, name change, zero-shot recognition, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.77)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.30)

Add feedback

Domain-Invariant Projection Learning for Zero-Shot Recognition

An Zhao, Mingyu Ding, Jiechao Guan, Zhiwu Lu, Tao Xiang, Ji-Rong Wen

Neural Information Processing SystemsNov-20-2025, 20:06:55 GMT

This is made possible by learning a projection between a feature space and a semantic space (e.g.

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.04)
North America > United States > California (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

fa2431bf9d65058fe34e9713e32d60e6-Paper.pdf

Neural Information Processing SystemsAug-22-2025, 01:09:53 GMT

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.04)
North America > United States > California (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(3 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

fa2431bf9d65058fe34e9713e32d60e6-AuthorFeedback.pdf

Neural Information Processing SystemsAug-22-2025, 01:09:43 GMT

artificial intelligence, image feature, prototype, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence (0.74)
Information Technology > Sensing and Signal Processing > Image Processing (0.53)

Add feedback

Training Dynamics Underlying Language Model Scaling Laws: Loss Deceleration and Zero-Sum Learning

Mircea, Andrei, Chakraborty, Supriyo, Chitsazan, Nima, Naphade, Milind, Sahu, Sambit, Rish, Irina, Lobacheva, Ekaterina

arXiv.org Artificial IntelligenceJul-16-2025

This work aims to understand how scaling improves language models, specifically in terms of training dynamics. We find that language models undergo loss deceleration early in training; an abrupt slowdown in the rate of loss improvement, resulting in piecewise linear behaviour of the loss curve in log-log space. Scaling up the model mitigates this transition by (1) decreasing the loss at which deceleration occurs, and (2) improving the log-log rate of loss improvement after deceleration. We attribute loss deceleration to a type of degenerate training dynamics we term zero-sum learning (ZSL). In ZSL, per-example gradients become systematically opposed, leading to destructive interference in per-example changes in loss. As a result, improving loss on one subset of examples degrades it on another, bottlenecking overall progress. Loss deceleration and ZSL provide new insights into the training dynamics underlying language model scaling laws, and could potentially be targeted directly to improve language models independent of scale. We make our code and artefacts available at: https://github.com/mirandrom/zsl

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2506.05447

Country:

North America > Canada (0.28)
North America > United States (0.28)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Reviews: Semantic-Guided Multi-Attention Localization for Zero-Shot Learning

Neural Information Processing SystemsJan-21-2025, 23:46:48 GMT

The problem is relevant and the method is based on an interesting attention based idea to look at different regions in the image for the task of ZSL The losses used focus on (i) making each attention map peaky, while making different maps diverse, (ii) embedding based softmax for better prediction and (iii) class center triplet loss which makes the features closer to their respective class centers relative to the other class centers. Line 190 mentions that the image and parts are sent to "separate backbone networks", which implies that the network parameters are not shared. If that is the case then the method will have 3x parameters cf competing methods ie. a significantly higher capacity network overall. What happens when the CNN params are shared? And what happens when the image only baseline has a higher capacity network backbone (which is also then end-to-end finetuned)?

semantic-guided multi-attention localization, supplementary section, zero-shot learning, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.54)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.43)

Add feedback

Filters

Collaborating Authors

zsl

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Domain-Invariant Projection Learning for Zero-Shot Recognition

Domain-Invariant Projection Learning for Zero-Shot Recognition

fa2431bf9d65058fe34e9713e32d60e6-Paper.pdf

HSVA: Hierarchical Semantic-Visual Adaptation for Zero-Shot Learning

Domain-Invariant Projection Learning for Zero-Shot Recognition

Domain-Invariant Projection Learning for Zero-Shot Recognition

fa2431bf9d65058fe34e9713e32d60e6-Paper.pdf

fa2431bf9d65058fe34e9713e32d60e6-AuthorFeedback.pdf

Training Dynamics Underlying Language Model Scaling Laws: Loss Deceleration and Zero-Sum Learning

Reviews: Semantic-Guided Multi-Attention Localization for Zero-Shot Learning