AITopics | collapse

Collaborating Authors

collapse

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Structured Federated Learning through Clustered Additive Modeling

Neural Information Processing SystemsDec-26-2025, 06:53:32 GMT

Heterogeneous federated learning without assuming any structure is challenging due to the conflicts among non-identical data distributions of clients. In practice, clients often comprise near-homogeneous clusters so training a server-side model per cluster mitigates the conflicts. However, FL with client clustering often suffers from "clustering collapse'', i.e., one cluster's model excels on increasing clients, and reduces to single-model FL. Moreover, cluster-wise models hinder knowledge sharing between clusters and each model depends on fewer clients. Furthermore, the static clustering assumption on data may not hold for dynamically changing models, which are sensitive to cluster imbalance/initialization or outliers.

clustered additive modeling, name change, structured federated learning, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.42)

Add feedback

Neural (Tangent Kernel) Collapse

Neural Information Processing SystemsDec-24-2025, 13:27:34 GMT

This work bridges two important concepts: the Neural Tangent Kernel (NTK), which captures the evolution of deep neural networks (DNNs) during training, and the Neural Collapse (NC) phenomenon, which refers to the emergence of symmetry and structure in the last-layer features of well-trained classification DNNs. We adopt the natural assumption that the empirical NTK develops a block structure aligned with the class labels, i.e., samples within the same class have stronger correlations than samples from different classes. Under this assumption, we derive the dynamics of DNNs trained with mean squared (MSE) loss and break them into interpretable phases. Moreover, we identify an invariant that captures the essence of the dynamics, and use it to prove the emergence of NC in DNNs with block-structured NTK. We provide large-scale numerical experiments on three common DNN architectures and three benchmark datasets to support our theory.

collapse, name change, tangent kernel, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.61)

Add feedback

Linguistic Collapse: Neural Collapse in (Large) Language Models

Neural Information Processing SystemsMay-27-2025, 21:35:19 GMT

Neural collapse ( \mathcal{NC}) is a phenomenon observed in classification tasks where top-layer representations collapse into their class means, which become equinorm, equiangular and aligned with the classifiers.These behaviors -- associated with generalization and robustness -- would manifest under specific conditions: models are trained towards zero loss, with noise-free labels belonging to balanced classes, which do not outnumber the model's hidden dimension.Recent studies have explored \mathcal{NC} in the absence of one or more of these conditions to extend and capitalize on the associated benefits of ideal geometries.Language modeling presents a curious frontier, as \textit{training by token prediction} constitutes a classification task where none of the conditions exist: the vocabulary is imbalanced and exceeds the embedding dimension; different tokens might correspond to similar contextual embeddings; and large language models (LLMs) in particular are typically only trained for a few epochs.This paper empirically investigates the impact of scaling the architectures and training of causal language models (CLMs) on their progression towards \mathcal{NC} .We find that \mathcal{NC} properties that develop with scale (and regularization) are linked to generalization.Moreover, there is evidence of some relationship between \mathcal{NC} and generalization independent of scale.Our work thereby underscores the generality of \mathcal{NC} as it extends to the novel and more challenging setting of language modeling.Downstream, we seek to inspire further research on the phenomenon to deepen our understanding of LLMs -- and neural networks at large -- and improve existing architectures based on \mathcal{NC} -related properties.

collapse, linguistic collapse, mathcal, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents

Neural Information Processing SystemsMay-27-2025, 16:28:25 GMT

As AI systems pervade human life, ensuring that large language models (LLMs) make safe decisions remains a significant challenge. We introduce the Governance of the Commons Simulation (GovSim), a generative simulation platform designed to study strategic interactions and cooperative decision-making in LLMs. In GovSim, a society of AI agents must collectively balance exploiting a common resource with sustaining it for future use. This environment enables the study of how ethical considerations, strategic planning, and negotiation skills impact cooperative outcomes. We develop an LLM-based agent architecture and test it with the leading open and closed LLMs.

emergence, llm agent, sustainable cooperation, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Structured Federated Learning through Clustered Additive Modeling

Neural Information Processing SystemsJan-19-2025, 12:41:01 GMT

clustered additive modeling, fed-cam algorithm, structured federated learning, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.77)

Add feedback

Out-of-Distribution Detection with An Adaptive Likelihood Ratio on Informative Hierarchical VAE

Neural Information Processing SystemsOct-10-2024, 13:20:13 GMT

Unsupervised out-of-distribution (OOD) detection is essential for the reliability of machine learning. In the literature, existing work has shown that higher-level semantics captured by hierarchical VAEs can be used to detect OOD instances.However, we empirically show that, the inherent issue of hierarchical VAEs, i.e., posterior collapse'', would seriously limit their capacity for OOD detection.Based on a thorough analysis forposterior collapse'', we propose a novel informative hierarchical VAE to alleviate this issue through enhancing the connections between the data sample and its multi-layer stochastic latent representations during training.Furthermore, we propose a novel score function for unsupervised OOD detection, referred to as Adaptive Likelihood Ratio. With this score function, one can selectively aggregate the semantic information on multiple hidden layers of hierarchical VAEs, leading to a strong separability between in-distribution and OOD samples. Experimental results demonstrate that our method can significantly outperform existing state-of-the-art unsupervised OOD detection approaches.

adaptive likelihood ratio, informative hierarchical vae, out-of-distribution detection, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.84)

Add feedback

Robot Gets Tired After Day's Work, Collapses: Watch

#artificialintelligenceApr-12-2023, 03:40:15 GMT

Viral Video: It has been a long time since we started availing the services of robots, the electronic humans, perhaps the first ever non-official definition of the wonder machine. Now, robotics is very much in vogue and has mass use across industries. One of the key reasons to deploy these programmable machines is their high efficiency and the ability to work for longer hours than humans without getting tired. However, a video has surfaced showing a robot placing plastic containers on a conveyor belt. The video is in a time-lapse, suggesting that the robot has been on the job for hours and the last few frames show the real-time where the machine picks up a container and as soon as it lifts it, it collapses.

collapse, robot, video, (9 more...)

#artificialintelligence

Country: Asia > India (0.43)

Industry: Media (0.36)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback