AITopics | training class

How Classifier Features Transfer to Downstream: An Asymptotic Analysis in a Two-Layer Model

Neural Information Processing SystemsJun-14-2026, 10:31:20 GMT

Neural networks learn effective feature representations, which can be transferred to new tasks without additional training. While larger datasets are known to improve feature transfer, the theoretical conditions for the success of such transfer remain unclear. This work investigates feature transfer in networks trained for classification to identify the conditions that enable effective clustering in unseen classes. We first reveal that higher similarity between training and unseen distributions leads to improved Cohesion and Separability. We then show that feature expressiveness is enhanced when inputs are similar to the training classes, while the features of irrelevant inputs remain indistinguishable.

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

Asia (0.28)
North America (0.27)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Leisure & Entertainment > Sports (1.00)
(5 more...)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(4 more...)

Add feedback

How Classifier Features Transfer to Downstream: An Asymptotic Analysis in a Two-Layer Model

Neural Information Processing SystemsJun-9-2026, 19:22:33 GMT

Neural networks learn effective feature representations, which can be transferred to new tasks without additional training. While larger datasets are known to improve feature transfer, the theoretical conditions for the success of such transfer remain unclear. This work investigates feature transfer in networks trained for classification to identify the conditions that enable effective clustering in unseen classes. We first reveal that higher similarity between training and unseen distributions leads to improved Cohesion and Separability. We then show that feature expressiveness is enhanced when inputs are similar to the training classes, while the features of irrelevant inputs remain indistinguishable.

artificial intelligence, machine learning, proceedings, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.41)

Add feedback

Learning to Propagate for Graph Meta-Learning

LU LIU, Tianyi Zhou, Guodong Long, Jing Jiang, Chengqi Zhang

Neural Information Processing SystemsApr-30-2026, 19:37:30 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, prototype, (16 more...)

Neural Information Processing Systems

Country: Oceania > Australia (0.28)

Industry:

Information Technology (1.00)
Government (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Open Vocabulary 3D Occupancy Prediction from Images Supplementary Material

Neural Information Processing SystemsFeb-16-2026, 04:29:22 GMT

In this supplementary material, we first give additional details about the method in Sec. 1. Queries used for zero-shot semantic segmentation. We do this for all the annotated classes in the dataset (second column). One can see that, for example, class name'manmade' lacks descriptive specificity. In the text description of this class, we can find "... buildings, walls, guard rails, fences, poles, street signs, traffic lights ..." and more. Table 1: Queries used for zero-shot semantic segmentation.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

Europe > Czechia > Prague (0.05)
Europe > France > Île-de-France > Paris > Paris (0.04)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Automobiles & Trucks (0.95)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)
Information Technology > Artificial Intelligence > Machine Learning (0.47)

Add feedback

Learning to Propagate for Graph Meta-Learning

LU LIU, Tianyi Zhou, Guodong Long, Jing Jiang, Chengqi Zhang

Neural Information Processing SystemsFeb-11-2026, 07:30:51 GMT

Inmost meta-learning methods, tasks areimplicitly related bysharing parameters oroptimizer. We develop a novel meta-learner of this type for prototype based classification, in which a prototype is generated for each class, such that the nearest neighbor search among the prototypes produces an accurate classification.

artificial intelligence, machine learning, prototype, (16 more...)

Neural Information Processing Systems

Country:

Oceania > Australia (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

3322a9a72a1707de14badd5e552ff466-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 06:14:53 GMT

dataset, influence function, training sample, (15 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.15)
Asia > China > Shanghai > Shanghai (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > California (0.04)

Genre:

Research Report > Experimental Study (0.94)
Research Report > New Finding (0.94)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Learning to Reconstruct Shapes from Unseen Classes

Xiuming Zhang, Zhoutong Zhang, Chengkai Zhang, Josh Tenenbaum, Bill Freeman, Jiajun Wu

Neural Information Processing SystemsNov-20-2025, 15:01:27 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, reconstruction, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

Open Vocabulary 3D Occupancy Prediction from Images Supplementary Material

Neural Information Processing SystemsOct-9-2025, 02:53:11 GMT

In this supplementary material, we first give additional details about the method in Sec. 1. Queries used for zero-shot semantic segmentation. We do this for all the annotated classes in the dataset (second column). One can see that, for example, class name'manmade' lacks descriptive specificity. In the text description of this class, we can find "... buildings, walls, guard rails, fences, poles, street signs, traffic lights ..." and more. Table 1: Queries used for zero-shot semantic segmentation.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

Europe > Czechia > Prague (0.05)
Europe > France > Île-de-France > Paris > Paris (0.04)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Automobiles & Trucks (0.95)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)
Information Technology > Artificial Intelligence > Machine Learning (0.47)

Add feedback

Boosting Open Set Recognition Performance through Modulated Representation Learning

Kundu, Amit Kumar, Patil, Vaishnavi S, Jaja, Joseph

arXiv.org Artificial IntelligenceSep-30-2025

The open set recognition (OSR) problem aims to identify test samples from novel semantic classes that are not part of the training classes, a task that is crucial in many practical scenarios. However, the existing OSR methods use a constant scaling factor (the temperature) to the logits before applying a loss function, which hinders the model from exploring both ends of the spectrum in representation learning -- from instance-level to semantic-level features. In this paper, we address this problem by enabling temperature-modulated representation learning using a set of proposed temperature schedules, including our novel negative cosine schedule. Our temperature schedules allow the model to form a coarse decision boundary at the beginning of training by focusing on fewer neighbors, and gradually prioritizes more neighbors to smooth out the rough edges. This gradual task switching leads to a richer and more generalizable representation space. While other OSR methods benefit by including regularization or auxiliary negative samples, such as with mix-up, thereby adding a significant computational overhead, our schedules can be folded into any existing OSR loss function with no overhead. We implement the novel schedule on top of a number of baselines, using cross-entropy, contrastive and the ARPL loss functions and find that it boosts both the OSR and the closed set performance in most cases, especially on the tougher semantic shift benchmarks. Project codes will be available.

machine learning, natural language, recognition, (19 more...)

arXiv.org Artificial Intelligence

2505.18137

Genre: Research Report (0.64)

Technology: