AITopics

2502.16535

Country:

North America > United States (0.14)
Europe > France (0.14)
Europe > Spain (0.14)
(2 more...)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.66)
Research Report > New Finding (0.46)

Industry:

Law Enforcement & Public Safety (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Ruah, Clement, Sifaou, Houssem, Simeone, Osvaldo, Al-Hashimi, Bashir

Context-Aware Doubly-Robust Semi-Supervised Learning

arXiv.org Artificial IntelligenceFeb-21-2025

--The widespread adoption of artificial intelligence (AI) in next-generation communication systems is challenged by the heterogeneity of traffic and network conditions, which call for the use of highly contextual, site-specific, data. A promising solution is to rely not only on real-world data, but also on synthetic pseudo-data generated by a network digital twin (NDT). However, the effectiveness of this approach hinges on the accuracy of the NDT, which can vary widely across different contexts. T o address this problem, this paper introduces context-aware doubly-robust (CDR) learning, a novel semi-supervised scheme that adapts its reliance on the pseudo-data to the different levels of fidelity of the NDT across contexts. CDR is evaluated on the task of downlink beamforming, showing superior performance compared to previous state-of-the-art semi-supervised approaches.

artificial intelligence, arxiv preprint arxiv, machine learning, (14 more...)

2502.15577

Country: Europe > United Kingdom > England (0.14)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.52)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.42)

Tarubinga, Ebenezer, Espinoza, Jenifer Kalafatovich

Confidence-Weighted Boundary-Aware Learning for Semi-Supervised Semantic Segmentation

arXiv.org Artificial IntelligenceFeb-20-2025

Semi-supervised semantic segmentation (SSSS) aims to improve segmentation performance by utilising unlabeled data alongside limited labeled samples. Existing SSSS methods often face challenges such as coupling, where over-reliance on initial labeled data leads to suboptimal learning; confirmation bias, where incorrect predictions reinforce themselves repeatedly; and boundary blur caused by insufficient boundary-awareness and ambiguous edge information. To address these issues, we propose CW-BASS, a novel framework for SSSS. In order to mitigate the impact of incorrect predictions, we assign confidence weights to pseudo-labels. Additionally, we leverage boundary-delineation techniques, which, despite being extensively explored in weakly-supervised semantic segmentation (WSSS) remain under-explored in SSSS. Specifically, our approach: (1) reduces coupling through a confidence-weighted loss function that adjusts the influence of pseudo-labels based on their predicted confidence scores, (2) mitigates confirmation bias with a dynamic thresholding mechanism that learns to filter out pseudo-labels based on model performance, (3) resolves boundary blur with a boundary-aware module that enhances segmentation accuracy near object boundaries, and (4) reduces label noise with a confidence decay strategy that progressively refines pseudo-labels during training. Extensive experiments on the Pascal VOC 2012 and Cityscapes demonstrate that our method achieves state-of-the-art performance. Moreover, using only 1/8 or 12.5\% of labeled data, our method achieves a mIoU of 75.81 on Pascal VOC 2012, highlighting its effectiveness in limited-label settings.

machine learning, natural language, semantic segmentation, (12 more...)

2502.15152

Genre: Research Report > New Finding (0.93)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Chung, Jichan, Chen, Irene Y.

Enhancing Semi-supervised Learning with Noisy Zero-shot Pseudolabels

arXiv.org Artificial IntelligenceFeb-18-2025

The growing scale of machine learning applications has made data labeling costs a critical bottleneck in deploying ML systems [1, 2, 3]. Semi-supervised learning (SSL) addresses this challenge by leveraging unlabeled data alongside limited labeled examples [4]. Traditional SSL approaches like pseudo-labeling and consistency regularization have demonstrated strong performance across domains, particularly in computer vision and natural language processing [5, 6, 4]. Recent advances in foundation models have enabled zero-shot inference on novel tasks without taskspecific training [7, 8]. These models can generate predictions for unseen tasks by leveraging their pretrained knowledge, offering a promising direction for reducing labeling requirements. Several works have proposed integrating these zero-shot capabilities into SSL frameworks [9, 10]. Current approaches primarily use foundation models as teacher networks for generating pseudo-labels through inference, which requires complex model distillation and introduces additional training overhead.

large language model, machine learning, natural language, (18 more...)

2502.12584

Country: North America > United States > California (0.46)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Asilis, Julian, Devic, Siddartha, Dughmi, Shaddin, Sharan, Vatsal, Teng, Shang-Hua

Proper Learnability and the Role of Unlabeled Data

arXiv.org Machine LearningFeb-14-2025

Proper learning refers to the setting in which learners must emit predictors in the underlying hypothesis class $H$, and often leads to learners with simple algorithmic forms (e.g. empirical risk minimization (ERM), structural risk minimization (SRM)). The limitation of proper learning, however, is that there exist problems which can only be learned improperly, e.g. in multiclass classification. Thus, we ask: Under what assumptions on the hypothesis class or the information provided to the learner is a problem properly learnable? We first demonstrate that when the unlabeled data distribution is given, there always exists an optimal proper learner governed by distributional regularization, a randomized generalization of regularization. We refer to this setting as the distribution-fixed PAC model, and continue to evaluate the learner on its worst-case performance over all distributions. Our result holds for all metric loss functions and any finite learning problem (with no dependence on its size). Further, we demonstrate that sample complexities in the distribution-fixed PAC model can shrink by only a logarithmic factor from the classic PAC model, strongly refuting the role of unlabeled data in PAC learning (from a worst-case perspective). We complement this with impossibility results which obstruct any characterization of proper learnability in the realizable PAC model. First, we observe that there are problems whose proper learnability is logically undecidable, i.e., independent of the ZFC axioms. We then show that proper learnability is not a monotone property of the underlying hypothesis class, and that it is not a local property (in a precise sense). Our impossibility results all hold even for the fundamental setting of multiclass classification, and go through a reduction of EMX learning (Ben-David et al., 2019) to proper classification which may be of independent interest.

artificial intelligence, machine learning, unlabeled data

arXiv.org Machine Learning

2502.10359

Genre: Research Report (1.00)

Industry: Education (0.53)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.80)

Neural Information Processing SystemsFeb-12-2025, 02:39:43 GMT

Supplementary Material: Learning Semantic-aware Normalization for Generative Adversarial Networks

It can be observed that features with low resolutions (e.g., 8 8 64 64) Figure 2 shows the semantic interpolation results. Table 1: Comparison of baseline, random grouping and semantic grouping (i.e., the proposed SGM) Table 2: Conduct semantic-aware control at different on LSUN CATS [26] in terms of FID. Figure 1: Visualization of the semantics learned in different resolutions. We show 16 groups in each layer with the resolution increasing from 8 8 to 256 256. The attention maps are obtained by averaging the feature maps in a group. It can be observed that features with low resolutions (i.e., 8 8 64 64) show better performance in learning semantics (e.g., eyes, mouths and hair). We can realize independent control on fine-grained semantics by conducting interpolation in latent space.

artificial intelligence, machine learning, resolution, (16 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.52)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.41)

Robertson, Sawyer Jack, Holtz, Chester, Wan, Zhengchao, Mishne, Gal, Cloninger, Alexander

Robust Graph-Based Semi-Supervised Learning via $p$-Conductances

arXiv.org Artificial IntelligenceFeb-12-2025

We study the problem of semi-supervised learning on graphs in the regime where data labels are scarce or possibly corrupted. We propose an approach called $p$-conductance learning that generalizes the $p$-Laplace and Poisson learning methods by introducing an objective reminiscent of $p$-Laplacian regularization and an affine relaxation of the label constraints. This leads to a family of probability measure mincut programs that balance sparse edge removal with accurate distribution separation. Our theoretical analysis connects these programs to well-known variational and probabilistic problems on graphs (including randomized cuts, effective resistance, and Wasserstein distance) and provides motivation for robustness when labels are diffused via the heat kernel. Computationally, we develop a semismooth Newton-conjugate gradient algorithm and extend it to incorporate class-size estimates when converting the continuous solutions into label assignments. Empirical results on computer vision and citation datasets demonstrate that our approach achieves state-of-the-art accuracy in low label-rate, corrupted-label, and partial-label regimes.

artificial intelligence, machine learning, robust graph-based semi-supervised learning

2502.08873

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.60)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.60)

Neural Information Processing SystemsFeb-11-2025, 23:45:11 GMT

Review for NeurIPS paper: Estimating the Effects of Continuous-valued Interventions using Generative Adversarial Networks

The paper studies the problem of estimating the effect of continuous treatment variables. The authors propose a GAN-based framework to learns the distribution of the unobserved counterfactuals. The reviewers found the theoretical contribution as well as the simulation showing improvement over the pre-existing benchmarks satisfying. Estimating the effect of a treatment is a central problem to causal inference and as such this paper could be of interest to the broader NeurIPS audience.

continuous-valued intervention, generative adversarial network, neurips paper

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Neural Information Processing SystemsFeb-11-2025, 20:55:24 GMT

065e259a1d2d955e63b99aac6a3a3081-Paper-Conference.pdf

In the adversarial training framework of Carmon et al. (2019); Gowal et al. (2021), people use generated/real unlabeled data with pseudolabels to improve adversarial robustness. We provide statistical insights to explain why the artificially generated data improve adversarial training. In particular, we study how the attack strength and the quality of the unlabeled data affect adversarial robustness in this framework. Our results show that with a high-quality unlabeled data generator, adversarial training can benefit greatly from this framework under large attack strength, while a poor generator can still help to some extent. To make adaptions concerning the quality of generated data, we propose an algorithm that performs online adjustment to the weight between the labeled real data and the generated data, aiming to optimize the adversarial risk. Numerical studies are conducted to verify our theories and show the effectiveness of the proposed algorithm.

adversarial training, artificial intelligence, machine learning, (17 more...)

Genre: Research Report > New Finding (0.86)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.79)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Neural Information Processing SystemsFeb-11-2025, 20:54:17 GMT

Review for NeurIPS paper: FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence

Four knowledgeable reviewers support acceptance for the contributions. Reviewers find that i) the proposed algorithm is simple; ii) efficient and empirical evaluation is very carefully designed with an extensive ablation study; iii) analysis on augmentation strategy and sharpening also provide good insights. Therefore, I also recommend acceptance. However, please consider revising your paper to address all the concerns and comments from the reviewers.

consistency and confidence, neurips paper, simplifying semi-supervised learning, (2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.40)