AITopics | Inductive Learning

Collaborating Authors

Inductive Learning

Inductive learning, or induction, is the process of creating generalizations from individual instances.

News Overviews Instructional Materials AI-Alerts Classics

A Scalable and High Availability Solution for Recommending Resolutions to Problem Tickets

Saragadam, Harish, Nayak, Chetana K, Bose, Joy

arXiv.org Artificial IntelligenceJul-30-2025

-- Resolution of i ncidents or problem tickets is a common theme in service industries in any sector, including billing and charging systems in telecom domain. Machine learning can help to identify patterns and suggest resolutions for the problem tickets, based on patterns in the historical data of the tickets . However, this process may be complicated due to a variety of phenomena such as data drift and issues such as missing data, lack of data pertaining to resolutions of past incidents, too many similar sound ing resolutions due to free text and similar sounding text . This paper proposes a robust ML - driven solution employing clustering, supervised learning, and advanced NLP models to tackle these challenges effectively. Building on previous work, w e demonstrate clustering - based resolution identification, supervised classification with LDA, Siamese networks, and One - shot learning, Index embedding . Additionally, we present a real - time dashboard and a highly available Kubernetes - based production deployment. Our experiments with both the open - source Bitext customer - support dataset and proprietary telecom datasets demonstrate high prediction accuracy. The problem of recommend ing resolutions for problem tickets or incidents on the basis of historical data is an important problem for service users, including telecom operators. Typically, service desks have dedicated manual teams that perform triaging of the issues and root cause analysis, and recommending a solution can take several hours end to end. Using machine learning models to recommend resolutions can save significant time and manpower of the operators by recommending solutions based on historical i ncident data. However, real - world application involves addressing several practical challenges: Diverse ticketing formats across service desks.

large language model, machine learning, resolution, (22 more...)

arXiv.org Artificial Intelligence

2507.19846

Country: Asia > India (0.15)

Genre:

Research Report (0.51)
Workflow (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

Transfer or Self-Supervised? Bridging the Performance Gap in Medical Imaging

Zhao, Zehui, Alzubaidi, Laith, Zhang, Jinglan, Duan, Ye, Naseem, Usman, Gu, Yuantong

arXiv.org Artificial IntelligenceJul-29-2025

Transfer Learning Using a light-weight model trained with target dataset directly can outperform the pre-trained TL model using natural images.[30] Self-Supervised Learning Pre-trained SSL model using natural images does not perform well with target COVID-19 samples and need further guidance from user.Problem Summary: domain discrepancy during pre-training willdegrade pre-trained model's performance[31] Transfer Learning Utilising pre-trained TL model does not bring significant improvement tothe target medical dataset with an imbalanced sample distribution.[32] Self-Supervised Learning The imbalanced source and target datasets lead to poor model performance even after self-supervised pre-training.Problem Summary: neither TL or SSL methods show improvedperformance towards imbalanced datasets[33] Transfer Learning The complexity of model and pre-training process makes it hard to understand the results and reduce the reliability of predictions.[34] Self-Supervised Learning The pre-training process of SSL model is fully unsupervised, which raised the concern for whether the model have fully understand the target dataset or is making predictions based on random factors.Problem Summary: the complexity of knowledge transferringprocess raised concerns of model reliabilityTable 1: Four main issues that constrained the application of pre-train methods in the medical field are summarised here: 1. the performance gap between TL and SSL in different data modalities, 2. the domain mismatch gap between source and target domain, 3. the challenge of data imbalance scenarios, 4. the difficulty in model explainability and analysis.

artificial intelligence, inductive learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2407.05592

Country:

Oceania > Australia > Queensland (0.04)
Oceania > Australia > New South Wales (0.04)
North America > United States > South Carolina (0.04)
Europe > France > Grand Est > Bas-Rhin > Strasbourg (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Health & Medicine > Therapeutic Area > Gastroenterology (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Efficient Learning for Product Attributes with Compact Multimodal Models

Kulkarni, Mandar

arXiv.org Artificial IntelligenceJul-29-2025

Image-based product attribute prediction in e-commerce is a crucial task with numerous applications. The supervised fine-tuning of Vision Language Models (VLMs) faces significant scale challenges due to the cost of manual or API based annotation. In this paper, we investigate label-efficient semi-supervised fine-tuning strategies for compact VLMs (2B-3B parameters) that leverage unlabeled product listings through Direct Preference Optimization (DPO). Beginning with a small, API-based, annotated, and labeled set, we first employ PEFT to train low-rank adapter modules. T o update the adapter weights with unlabeled data, we generate multiple reasoning-and-answer chains per unlabeled sample and segregate these chains into preferred and dispreferred based on self-consistency. W e then fine-tune the model with DPO loss and use the updated model for the next iteration. By using PEFT fine-tuning with DPO, our method achieves efficient convergence with minimal compute overhead. On a dataset spanning twelve e-commerce verticals, DPO-based fine-tuning, which utilizes only unlabeled data, demonstrates a significant improvement over the supervised model. Moreover, experiments demonstrate that accuracy with DPO training improves with more unlabeled data, indicating that a large pool of unlabeled samples can be effectively leveraged to improve performance.

large language model, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2507.19679

Genre: Research Report > New Finding (0.46)

Industry: Information Technology (0.71)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.78)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.47)

Add feedback

Scale-Consistent Learning for Partial Differential Equations

Li, Zongyi, Lanthaler, Samuel, Deng, Catherine, Chen, Michael, Wang, Yixuan, Azizzadenesheli, Kamyar, Anandkumar, Anima

arXiv.org Artificial IntelligenceJul-28-2025

Machine learning (ML) models have emerged as a promising approach for solving partial differential equations (PDEs) in science and engineering. Previous ML models typically cannot generalize outside the training data; for example, a trained ML model for the Navier-Stokes equations only works for a fixed Reynolds number ($Re$) on a pre-defined domain. To overcome these limitations, we propose a data augmentation scheme based on scale-consistency properties of PDEs and design a scale-informed neural operator that can model a wide range of scales. Our formulation leverages the facts: (i) PDEs can be rescaled, or more concretely, a given domain can be re-scaled to unit size, and the parameters and the boundary conditions of the PDE can be appropriately adjusted to represent the original solution, and (ii) the solution operators on a given domain are consistent on the sub-domains. We leverage these facts to create a scale-consistency loss that encourages matching the solutions evaluated on a given domain and the solution obtained on its sub-domain from the rescaled PDE. Since neural operators can fit to multiple scales and resolutions, they are the natural choice for incorporating scale-consistency loss during training of neural PDE solvers. We experiment with scale-consistency loss and the scale-informed neural operator model on the Burgers' equation, Darcy Flow, Helmholtz equation, and Navier-Stokes equations. With scale-consistency, the model trained on $Re$ of 1000 can generalize to $Re$ ranging from 250 to 10000, and reduces the error by 34% on average of all datasets compared to baselines.

artificial intelligence, inductive learning, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2507.18813

Genre: Research Report > Promising Solution (0.54)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.67)

Add feedback

JCAPT: A Joint Modeling Approach for CAPT

Yang, Tzu-Hsuan, He, Yue-Yang, Chen, Berlin

arXiv.org Artificial IntelligenceJul-28-2025

Effective pronunciation feedback is critical in second language (L2) learning, for which computer-assisted pronunciation training (CAPT) systems often encompass two key tasks: automatic pronunciation assessment (AP A) and mispronunciation detection and diagnosis (MDD). Recent work has shown that joint modeling of these two tasks can yield mutual benefits. Our unified framework leverages Mamba, a selective state space model (SSM), while integrating phonological features and think token strategies to jointly enhance interpretability and fine-grained temporal reasoning in AP A and MDD. To our knowledge, this is the first study to combine phonological attribution, SSM-based modeling, and prompting in CAPT. A series of experiments conducted on the speechocean762 benchmark demonstrate that our model consistently outperforms prior methods, particularly on the MDD task.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2506.19315

Country: Asia > Taiwan (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.70)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.48)
(3 more...)

Add feedback

CM-UNet: A Self-Supervised Learning-Based Model for Coronary Artery Segmentation in X-Ray Angiography

Challier, Camille, Sun, Xiaowu, Mahendiran, Thabo, Senouf, Ortal, De Bruyne, Bernard, Auberson, Denise, Müller, Olivier, Fournier, Stephane, Frossard, Pascal, Abbé, Emmanuel, Thanou, Dorina

arXiv.org Artificial IntelligenceJul-25-2025

Accurate segmentation of coronary arteries remains a significant challenge in clinical practice, hindering the ability to effectively diagnose and manage coronary artery disease. The lack of large, annotated datasets for model training exacerbates this issue, limiting the development of automated tools that could assist radiologists. To address this, we introduce CM-UNet, which leverages self-supervised pre-training on unannotated datasets and transfer learning on limited annotated data, enabling accurate disease detection while minimizing the need for extensive manual annotations. Fine-tuning CM-UNet with only 18 annotated images instead of 500 resulted in a 15.2% decrease in Dice score, compared to a 46.5% drop in baseline models without pre-training. This demonstrates that self-supervised learning can enhance segmentation performance and reduce dependence on large datasets. This is one of the first studies to highlight the importance of self-supervised learning in improving coronary artery segmentation from X-ray angiography, with potential implications for advancing diagnostic accuracy in clinical practice. By enhancing segmentation accuracy in X-ray angiography images, the proposed approach aims to improve clinical workflows, reduce radiologists' workload, and accelerate disease detection, ultimately contributing to better patient outcomes. The source code is publicly available at https://github.com/CamilleChallier/Contrastive-Masked-UNet.

artificial intelligence, machine learning, segmentation, (14 more...)

arXiv.org Artificial Intelligence

2507.17779

Country: Europe (0.28)

Genre:

Research Report > Experimental Study (0.94)
Research Report > New Finding (0.68)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.81)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Self-Supervised Inductive Logic Programming

Patsantzis, Stassa

arXiv.org Artificial IntelligenceJul-23-2025

Inductive Logic Programming (ILP) approaches like Meta \-/ Interpretive Learning (MIL) can learn, from few examples, recursive logic programs with invented predicates that generalise well to unseen instances. This ability relies on a background theory and negative examples, both carefully selected with expert knowledge of a learning problem and its solutions. But what if such a problem-specific background theory or negative examples are not available? We formalise this question as a new setting for Self-Supervised ILP and present a new MIL algorithm that learns in the new setting from some positive labelled, and zero or more unlabelled examples, and automatically generates, and labels, new positive and negative examples during learning. We implement this algorithm in Prolog in a new MIL system, called Poker. We compare Poker to state-of-the-art MIL system Louise on experiments learning grammars for Context-Free and L-System languages from labelled, positive example strings, no negative examples, and just the terminal vocabulary of a language, seen in examples, as a first-order background theory. We introduce a new approach for the principled selection of a second-order background theory as a Second Order Definite Normal Form (SONF), sufficiently general to learn all programs in a class, thus removing the need for a backgound theory tailored to a learning task. We find that Poker's performance improves with increasing numbers of automatically generated examples while Louise, bereft of negative examples, over-generalises.

background theory, logic & formal reasoning, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2507.16405

Genre: Research Report (0.64)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.47)

Add feedback

Influence Functions for Preference Dataset Pruning

Fein, Daniel, Aranguiz-Dias, Gabriela

arXiv.org Artificial IntelligenceJul-22-2025

Language models are commonly fine-tuned via reinforcement learning to alter their behavior or elicit new capabilities. Datasets used for these purposes, and particularly human preference datasets, are often noisy. The relatively small size post-training datasets, combined with parameter-efficient fine-tuning methods, enable the use of influence functions approximations to detect and prune training examples that are harmful to performance on a validation set. In this work, we adapt the TL;DR dataset for reward model training to demonstrate how conjugate-gradient approximated influence functions can be used to filter datasets. In our experiments, influence function filtering yields a small retraining accuracy uplift of 1.5% after removing 10% of training examples. We also show that gradient similarity outperforms influence functions for detecting helpful training examples. This suggests that local curvature is important for detecting harmful training examples, but less so for identifying helpful examples.

artificial intelligence, machine learning, training example, (16 more...)

arXiv.org Artificial Intelligence

2507.14344

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

SemiOccam: A Robust Semi-Supervised Image Recognition Network Using Sparse Labels

Yann, Rui, Zhang, Tianshuo, Xing, Xianglei

arXiv.org Artificial IntelligenceJul-22-2025

We present SemiOccam, an image recognition network that leverages semi-supervised learning in a highly efficient manner. Existing works often rely on complex training techniques and architectures, requiring hundreds of GPU hours for training, while their generalization ability with extremely limited labeled data remains to be improved. To address these limitations, we construct a hierarchical mixture density classification mechanism by optimizing mutual information between feature representations and target classes, compressing redundant information while retaining crucial discriminative components. Experimental results demonstrate that our method achieves state-of-the-art performance on three commonly used datasets, with accuracy exceeding 95% on two of them using only 4 labeled samples per class, and its simple architecture keeps training time at the minute level. Notably, this paper reveals a long-overlooked data leakage issue in the STL-10 dataset for semi-supervised learning and removes duplicates to ensure reliable experimental results. We release the deduplicated CleanSTL-10 dataset to facilitate fair and reproducible research. Code available at https://github.com/Shu1L0n9/SemiOccam.

artificial intelligence, machine learning, pattern recognition, (20 more...)

arXiv.org Artificial Intelligence

2506.03582

Country: Asia > China (0.16)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.72)
(4 more...)

Add feedback

A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys

Luo, Yufeng, Myers, Adam D., Drlica-Wagner, Alex, Dematties, Dario, Borchani, Salma, Valdes, Frank, Dey, Arjun, Schlegel, David, Zhou, Rongpu, Team, DESI Legacy Imaging Surveys

arXiv.org Artificial IntelligenceJul-18-2025

As the data volume of astronomical imaging surveys rapidly increases, traditional methods for image anomaly detection, such as visual inspection by human experts, are becoming impractical. We introduce a machine-learning-based approach to detect poor-quality exposures in large imaging surveys, with a focus on the DECam Legacy Survey (DECaLS) in regions of low extinction (i.e., $E(B-V)<0.04$). Our semi-supervised pipeline integrates a vision transformer (ViT), trained via self-supervised learning (SSL), with a k-Nearest Neighbor (kNN) classifier. We train and validate our pipeline using a small set of labeled exposures observed by surveys with the Dark Energy Camera (DECam). A clustering-space analysis of where our pipeline places images labeled in ``good'' and ``bad'' categories suggests that our approach can efficiently and accurately determine the quality of exposures. Applied to new imaging being reduced for DECaLS Data Release 11, our pipeline identifies 780 problematic exposures, which we subsequently verify through visual inspection. Being highly efficient and adaptable, our method offers a scalable solution for quality control in other large imaging surveys.

artificial intelligence, exposure, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2507.12784

Country:

Europe (1.00)
North America > United States > Illinois > Cook County (0.14)

Genre: Research Report (0.64)

Industry:

Energy (1.00)
Government > Regional Government > North America Government > United States Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.68)

Add feedback