AITopics | classifier agreement

Collaborating Authors

classifier agreement

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Your Classifier can Secretly Suffice Multi-Source Domain Adaptation

Neural Information Processing SystemsDec-23-2025, 22:15:58 GMT

Multi-Source Domain Adaptation (MSDA) deals with the transfer of task knowledge from multiple labeled source domains to an unlabeled target domain, under a domain-shift. Existing methods aim to minimize this domain-shift using auxiliary distribution alignment objectives. In this work, we present a different perspective to MSDA wherein deep models are observed to implicitly align the domains under label supervision. Thus, we aim to utilize implicit alignment without additional training objectives to perform adaptation. To this end, we use pseudo-labeled target samples and enforce a classifier agreement on the pseudo-labels, a process called Self-supervised Implicit Alignment (SImpAl). We find that SImpAl readily works even under category-shift among the source domains. Further, we propose classifier agreement as a cue to determine the training convergence, resulting in a simple training algorithm. We provide a thorough evaluation of our approach on five benchmarks, along with detailed insights into each component of our approach.

multi-source domain adaptation, name change, secretly suffice multi-source domain adaptation, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.41)

Add feedback

Supplementary: Y our Classifier can Secretly Suffice Multi-Source Domain Adaptation

Neural Information Processing SystemsOct-2-2025, 14:47:55 GMT

Owing to the limits of space, we present a summary of results on DomainNet in the paper. The results for the prior arts are reported from [9]. Finally, we study thresholding schemes. We find that SImpAl works well even under category-shift. Our approach exhibits a relatively lower drop in accuracy.

alignment, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

To all the reviewers

Neural Information Processing SystemsOct-2-2025, 14:47:36 GMT

We thank the reviewers for their valuable suggestions to improve the draft. We address the concerns below. Our prime contribution is in the form of insights that lead to a simple design, which makes our work different. Likewise, we show that even under category-shift (Sec. Sec. 2), which is relatively less explored in MSDA.

artificial intelligence, reviewer, suppl, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.32)

Add feedback

Linguistic Collapse: Neural Collapse in (Large) Language Models

Wu, Robert, Papyan, Vardan

arXiv.org Machine LearningMay-27-2024

Neural collapse ($\mathcal{NC}$) is a phenomenon observed in classification tasks where top-layer representations collapse into their class means, which become equinorm, equiangular and aligned with the classifiers. These behaviors -- associated with generalization and robustness -- would manifest under specific conditions: models are trained towards zero loss, with noise-free labels belonging to balanced classes, which do not outnumber the model's hidden dimension. Recent studies have explored $\mathcal{NC}$ in the absence of one or more of these conditions to extend and capitalize on the associated benefits of ideal geometries. Language modeling presents a curious frontier, as \textit{training by token prediction} constitutes a classification task where none of the conditions exist: the vocabulary is imbalanced and exceeds the embedding dimension; different tokens might correspond to similar contextual embeddings; and large language models (LLMs) in particular are typically only trained for a few epochs. This paper empirically investigates the impact of scaling the architectures and training of causal language models (CLMs) on their progression towards $\mathcal{NC}$. We find that $\mathcal{NC}$ properties that develop with scaling are linked to generalization. Moreover, there is evidence of some relationship between $\mathcal{NC}$ and generalization independent of scale. Our work therefore underscores the generality of $\mathcal{NC}$ as it extends to the novel and more challenging setting of language modeling. Downstream, we seek to inspire further research on the phenomenon to deepen our understanding of LLMs -- and neural networks at large -- and improve existing architectures based on $\mathcal{NC}$-related properties.

generalization, hidden dimension, neural collapse, (13 more...)

arXiv.org Machine Learning

2405.17767

Country: