AITopics | unlabelled data

SEAL: Semantic-Aware Hierarchical Learning for Generalized Category Discovery

Neural Information Processing SystemsJun-23-2026, 01:31:49 GMT

This paper investigates the problem of Generalized Category Discovery (GCD). Given a partially labelled dataset, GCD aims to categorize all unlabelled images, regardless of whether they belong to known or unknown classes. Existing approaches typically depend on either single-level semantics or manually designed abstract hierarchies, which limit their generalizability and scalability. To address these limitations, we introduce a SEmantic-aware hierArchical Learning framework (SEAL), guided by naturally occurring and easily accessible hierarchical structures. Within SEAL, we propose a Hierarchical Semantic-Guided Soft Contrastive Learning approach that exploits hierarchical similarity to generate informative soft negatives, addressing the limitations of conventional contrastive losses that treat all negatives equally. Furthermore, a Cross-Granularity Consistency (CGC) module is designed to align the predictions from different levels of granularity. SEAL consistently achieves state-of-the-art performance on finegrained benchmarks, including the SSB benchmark, Oxford-Pet, and the Herbarium19 dataset, and further demonstrates generalization on coarse-grained datasets.

artificial intelligence, deep learning, machine learning, (15 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (1.00)

Industry: Information Technology > Security & Privacy (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Prediction-powered Inference by Mixture of Experts

Gu, Yanwu, Kong, Linglong, Xia, Dong

arXiv.org Machine LearningMay-1-2026

The rapidly expanding artificial intelligence (AI) industry has produced diverse yet powerful prediction tools, each with its own network architecture, training strategy, data-processing pipeline, and domain-specific strengths. These tools create new opportunities for semi-supervised inference, in which labeled data are limited and expensive to obtain, whereas unlabeled data are abundant and widely available. Given a collection of predictors, we treat them as a mixture of experts (MOE) and introduce an MOE-powered semi-supervised inference framework built upon prediction-powered inference (PPI). Motivated by the variance reduction principle underlying PPI, the proposed framework seeks the mixture of experts that achieves the smallest possible variance. Compared with standard PPI, the MOE-powered inference framework adapts to the unknown performance of individual predictors, benefits from their collective predictive power, and enjoys a best-expert guarantee. The framework is flexible and applies to mean estimation, linear regression, quantile estimation, and general M-estimation. We develop non-asymptotic theory for the MOE-powered inference framework and establish upper bounds on the coverage error of the resulting confidence intervals. Numerical experiments demonstrate the practical effectiveness of MOE-powered inference and corroborate our theoretical findings.

artificial intelligence, estimator, machine learning, (18 more...)

arXiv.org Machine Learning

2604.27892

Country:

North America (0.45)
Asia (0.28)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.92)

Add feedback

3a61ed715ee66c48bacf237fa7bb5289-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 12:40:57 GMT

artificial intelligence, feature extractor, machine learning, (13 more...)

Neural Information Processing Systems

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

0918183ced31affb7ce0345e45ac1943-Paper-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 11:08:27 GMT

artificial intelligence, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country: Asia (0.28)

Genre: Research Report (0.94)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

NovelVisualCategoryDiscoverywithDualRanking StatisticsandMutualKnowledgeDistillation-SupplementaryMaterial-BingchenZhao1 KaiHan2,3,4

Neural Information Processing SystemsFeb-11-2026, 00:37:31 GMT

Itcan be seen, except the extreme case with very smallk (e.g.k = 1), the results are generally stable, further corroborating the robustness of ranking statistics. We also carry out experiments using "hard" and "soft" cosine similarity. For the "hard" cosine similarity, we simply adopt athreshold (0.9 inour experiments) onthe score toget binary pseudo labels. While for the "soft" cosine similarity, we directly take the score as soft pseudo labels. Wechoose tousesoftranking statistics because webelievethe continuous similarity better reflect the actually similarity of objects than the binary score. This is important for the pairs with a similarity score around 0.5, for which the binary score is not very reliable.

artificial intelligence, dataset, statistics, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.91)

Add feedback

c203d8a151612acf12457e4d67635a95-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 00:37:27 GMT

novel category discovery, ranking statistics, unlabelled data, (11 more...)

Neural Information Processing Systems

Country: Asia > China > Hong Kong (0.04)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Add feedback

NeuralViewSynthesisandMatching forSemi-SupervisedFew-ShotLearningof3DPose

Neural Information Processing SystemsFeb-8-2026, 06:54:52 GMT

Ourmodel is trained in an EM-type manner alternating between increasing the 3D pose invariance ofthefeature extractor andannotating unlabelled data through neural viewsynthesis andmatching.

artificial intelligence, inductive learning, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States > Maryland (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.30)

Add feedback

0918183ced31affb7ce0345e45ac1943-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 08:56:25 GMT

dataset, international conference, learning, (13 more...)

Neural Information Processing Systems

Country:

Africa (0.04)
Europe > France (0.04)
Asia > Nepal (0.04)
Asia > Indonesia (0.04)

Genre: Research Report (0.94)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose

Neural Information Processing SystemsDec-24-2025, 00:13:58 GMT

We study the problem of learning to estimate the 3D object pose from a few labelled examples and a collection of unlabelled data. Our main contribution is a learning framework, neural view synthesis and matching, that can transfer the 3D pose annotation from the labelled to unlabelled images reliably, despite unseen 3D views and nuisance variations such as the object shape, texture, illumination or scene context. In our approach, objects are represented as 3D cuboid meshes composed of feature vectors at each mesh vertex. The model is initialized from a few labelled images and is subsequently used to synthesize feature representations of unseen 3D views. The synthesized views are matched with the feature representations of unlabelled images to generate pseudo-labels of the 3D pose.

name change, neural view synthesis and matching, semi-supervised few-shot learning, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Wasserstein distance based semi-supervised manifold learning and application to GNSS multi-path detection

Blais, Antoine, Couëllan, Nicolas

arXiv.org Machine LearningDec-8-2025

The main objective of this study is to propose an optimal transport based semi-supervised approach to learn from scarce labelled image data using deep convolutional networks. The principle lies in implicit graph-based transductive semi-supervised learning where the similarity metric between image samples is the Wasserstein distance. This metric is used in the label propagation mechanism during learning. We apply and demonstrate the effectiveness of the method on a GNSS real life application. More specifically, we address the problem of multi-path interference detection. Experiments are conducted under various signal conditions. The results show that for specific choices of hyperparameters controlling the amount of semi-supervision and the level of sensitivity to the metric, the classification accuracy can be significantly improved over the fully supervised training method.

application, sup, wasserstein distance, (15 more...)

arXiv.org Machine Learning

2512.05567

Country: