AITopics | Diagnosis

Collaborating Authors

Diagnosis

News Overviews Instructional Materials AI-Alerts Classics

Feature Learning for Interpretable, Performant Decision Trees Supplementary Material 1 Experiment Specification

Neural Information Processing SystemsOct-9-2025, 08:10:59 GMT

Here we cover the full specification of the experiments. Some details were omitted from the main text. If there were separate training and test sets, they were combined before creating the random 10-fold split. All attributes are normalized to mean 0 and standard deviation 1. Additional details for each model type follow.

artificial intelligence, machine learning, time 0, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.50)
Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.40)

Add feedback

Syn-Diag: An LLM-based Synergistic Framework for Generalizable Few-shot Fault Diagnosis on the Edge

Jia, Zijun, Liang, Shuang, Yu, Jinsong

arXiv.org Artificial IntelligenceOct-8-2025

Industrial fault diagnosis faces the dual challenges of data scarcity and the difficulty of deploying large AI models in resource-constrained environments. This paper introduces Syn-Diag, a novel cloud-edge synergistic framework that leverages Large Language Models to overcome these limitations in few-shot fault diagnosis. Syn-Diag is built on a three-tiered mechanism: 1) Visual-Semantic Synergy, which aligns signal features with the LLM's semantic space through cross-modal pre-training; 2) Content-Aware Reasoning, which dynamically constructs contextual prompts to enhance diagnostic accuracy with limited samples; and 3) Cloud-Edge Synergy, which uses knowledge distillation to create a lightweight, efficient edge model capable of online updates via a shared decision space. Extensive experiments on six datasets covering different CWRU and SEU working conditions show that Syn-Diag significantly outperforms existing methods, especially in 1-shot and cross-condition scenarios. The edge model achieves performance comparable to the cloud version while reducing model size by 83% and latency by 50%, offering a practical, robust, and deployable paradigm for modern intelligent diagnostics.

large language model, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2510.05733

Genre: Research Report (1.00)

Industry:

Education (0.94)
Health & Medicine > Diagnostic Medicine (0.48)
Health & Medicine > Consumer Health (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Ocular-Induced Abnormal Head Posture: Diagnosis and Missing Data Imputation

Al-Dabet, Saja, Turaev, Sherzod, Zaki, Nazar, Khan, Arif O., Eldweik, Luai

arXiv.org Artificial IntelligenceOct-8-2025

Ocular-induced abnormal head posture (AHP) is a compensatory mechanism that arises from ocular misalignment conditions, such as strabismus, enabling patients to reduce diplopia and preserve binocular vision. Early diagnosis minimizes morbidity and secondary complications such as facial asymmetry; however, current clinical assessments remain largely subjective and are further complicated by incomplete medical records. This study addresses both challenges through two complementary deep learning frameworks. First, AHP-CADNet is a multi-level attention fusion framework for automated diagnosis that integrates ocular landmarks, head pose features, and structured clinical attributes to generate interpretable predictions. Second, a curriculum learning-based imputation framework is designed to mitigate missing data by progressively leveraging structured variables and unstructured clinical notes to enhance diagnostic robustness under realistic data conditions. Evaluation on the PoseGaze-AHP dataset demonstrates robust diagnostic performance. AHP-CADNet achieves 96.9-99.0 percent accuracy across classification tasks and low prediction errors for continuous variables, with MAE ranging from 0.103 to 0.199 and R2 exceeding 0.93. The imputation framework maintains high accuracy across all clinical variables (93.46-99.78 percent with PubMedBERT), with clinical dependency modeling yielding significant improvements (p < 0.001). These findings confirm the effectiveness of both frameworks for automated diagnosis and recovery from missing data in clinical settings.

data mining, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2510.05649

Country:

North America > United States (0.28)
Asia > Middle East > UAE (0.14)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (1.00)
Health & Medicine > Health Care Technology > Medical Record (1.00)
Health & Medicine > Diagnostic Medicine (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Data Science > Data Quality (1.00)
Information Technology > Data Science > Data Mining (1.00)
(5 more...)

Add feedback

A Trustworthy Industrial Fault Diagnosis Architecture Integrating Probabilistic Models and Large Language Models

wu, Yue

arXiv.org Artificial IntelligenceOct-7-2025

Abstract: Addressing the core problem of insufficient trustworthiness in industrial fault diagnosis, stemming from the limitations of existing methods -- both traditional and deep learning - based -- in terms of interpretability, generalization, and uncertainty quantification, this paper proposes a trustworthy industrial fault diagnosis architecture, the Hierarchical Cognitive Arbitration Architecture (HCAA), which integrates probabilistic models with Large Language Models (LLMs). The architecture conducts a preliminary analysis via a diagnostic engine based on a Bayesian network and features an LLM - driven cognitive arbitration module with multimodal input capabilities. This module performs expert - level arbitration on the initial diagnosis by analyzing structured features and diagnostic charts, holding the priority to make the final decision upon detecting conflicts. To ensure the reliability of the system's output, the architecture integrates a confidence calibration module based on Temperature Scaling and a risk assessment module, which objectively quantify system trustworthiness using metrics like Expected Calibration Error (ECE). Experimental results on a dataset containing multiple fault types demonstrate that the proposed framework improves diagnostic accuracy by over 28 percentage points compared to baseline models, while the post - calibration ECE is reduced by more than 75%. Case studies confirm that the HCAA effectively corrects misjudgments from traditional models caused by complex feature patterns or knowledge gaps, providing a novel and practical engineering solution for building high - trust, explainable AI diagnostic systems for industrial applications. Keywords: Industrial Fault Diagnosis; Large Language Model (LLM); Hierarchical Cognitive Arbitration; Probabilistic Model; Confidence Calibration; Trustworthy AI 1. Introduction With the deep development of Industry 4.0 and smart manufacturing concepts, modern industrial systems are evolving towards high levels of automation and intelligence. In this process, the reliability and safety of equipment have become key factors determining production efficiency and operational costs. Prognostics and Health Management (PHM), as a core technology, plays an indispensable role in improving equipment reliability, reducing unplanned downtime, and optimizing maintenance costs by monitoring equipment status in real - time, diagnosing potential faults, and predicting remaining useful life [1], [2].

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2510.03815

Genre: Research Report (1.00)

Industry:

Law (0.99)
Health & Medicine > Diagnostic Medicine (0.70)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (1.00)
(3 more...)

Add feedback

Generalization of Graph Neural Network Models for Distribution Grid Fault Detection

Karabulut, Burak, Manna, Carlo, Develder, Chris

arXiv.org Artificial IntelligenceOct-7-2025

Fault detection in power distribution grids is critical for ensuring system reliability and preventing costly outages. Moreover, fault detection methodologies should remain robust to evolving grid topologies caused by factors such as reconfigurations, equipment failures, and Distributed Energy Resource (DER) integration. Current data-driven state-of-the-art methods use Recurrent Neural Networks (RNNs) for temporal modeling and Graph Neural Networks (GNNs) for spatial learning, in an RNN+GNN pipeline setting (RGNN in short). Specifically, for power system fault diagnosis, Graph Convolutional Networks (GCNs) have been adopted. Yet, various more advanced GNN architectures have been proposed and adopted in domains outside of power systems. In this paper, we set out to systematically and consistently benchmark various GNN architectures in an RNN+GNN pipeline model. Specifically, to the best of our knowledge, we are the first to (i) propose to use GraphSAGE and Graph Attention (GAT, GATv2) in an RGNN for fault diagnosis, and (ii) provide a comprehensive benchmark against earlier proposed RGNN solutions (RGCN) as well as pure RNN models (especially Gated Recurrent Unit (GRU)), particularly (iii) exploring their generalization potential for deployment in different settings than those used for training them. Our experimental results on the IEEE 123-node distribution network show that RGATv2 has superior generalization capabilities, maintaining high performance with an F1-score reduction of $\sim$12% across different topology settings. In contrast, pure RNN models largely fail, experiencing an F1-score reduction of up to $\sim$60%, while other RGNN variants also exhibit significant performance degradation, i.e., up to $\sim$25% lower F1-scores.

artificial intelligence, expert system, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2510.03571

Country:

Europe (0.68)
North America > United States (0.46)

Genre: Research Report (1.00)

Industry:

Energy > Power Industry (1.00)
Energy > Renewable > Solar (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Optimal Decision Tree with Noisy Outcomes

Su Jia, viswanath nagarajan, Fatemeh Navidi, R Ravi

Neural Information Processing SystemsOct-3-2025, 03:38:32 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, hypothesis, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States > Pennsylvania (0.28)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.52)
Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.42)

Add feedback

Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing SystemsOct-3-2025, 03:37:43 GMT

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. The paper introduces an efficient feature transform of local decorrelation, which when combined with boosted (orthogonal) decision trees, considerably improves over the state-of-the-art on pedestrian detection. Overall, it is a clearly (and nicely) written paper with good analysis, enough details and solid experiments. Pros: - Very well written and executed paper - Attention to detail - Solid results - Straight forward and intuitive method Cons: - Incremental from Hariharan et al. (not major, see later) - If it claims ``Improved Detection'', as opposed to ``Improved Pedestrian Detection'', then I would have liked to see some more results on object detection or likewise. Going from global to local decorrelation, and doing the right analysis for design decisions set it apart.

decision tree, detection, feature transform, (13 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.04)

Industry:

Transportation > Ground > Road (0.58)
Automobiles & Trucks (0.58)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.38)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.38)

Add feedback

GRPO++: Enhancing Dermatological Reasoning under Low Resource Settings

Swapnil, Ismam Nur, Saha, Aranya, Khan, Tanvir Ahmed, Haque, Mohammad Ariful

arXiv.org Artificial IntelligenceOct-3-2025

Abstract-- Vision-Language Models (VLMs) show promise in medical image analysis, yet their capacity for structured reasoning in complex domains like dermatology is often limited by data scarcity and the high computational cost of advanced training techniques. To address these challenges, we introduce DermIQ-VLM, a VLM developed through a multi-stage, resource-efficient methodology designed to emulate a dermatologist's diagnostic process. Our primary contribution is a modified version of Grouped Relative Policy Optimization (GRPO), called GRPO++, which stabilizes the powerful but data-intensive GRPO framework. Our proposed training pipeline first employs GRPO++ for reasoning-oriented disease recognition, followed by supervised fine-tuning for conversational ability. To mitigate factual errors introduced during this step, we then align the model using Direct Preference Optimization (DPO), leveraging a Knowledge Graph-based system as a scalable proxy for expert preference. A preliminary evaluation on a curated dermatological dataset demonstrates that our proposed methodology yields notable performance gains over standard fine-tuning approaches.

arxiv preprint arxiv, large language model, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2510.01236

Genre: Research Report (0.50)

Industry:

Health & Medicine > Therapeutic Area > Dermatology (1.00)
Health & Medicine > Diagnostic Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.68)

Add feedback

Survey of AI-Powered Approaches for Osteoporosis Diagnosis in Medical Imaging

Rahman, Abdul, Lee, Bumshik

arXiv.org Artificial IntelligenceOct-2-2025

Osteoporosis silently erodes skeletal integrity worldwide; however, early detection through imaging can prevent most fragility fractures. Artificial intelligence (AI) methods now mine routine Dual-energy X-ray Absorptiometry (DXA), X-ray, Computed Tomography (CT), and Magnetic Resonance Imaging (MRI) scans for subtle, clinically actionable markers, but the literature is fragmented. This survey unifies the field through a tri-axial framework that couples imaging modalities with clinical tasks and AI methodologies (classical machine learning, convolutional neural networks (CNNs), transformers, self-supervised learning, and explainable AI). Following a concise clinical and technical primer, we detail our Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA)-guided search strategy, introduce the taxonomy via a roadmap figure, and synthesize cross-study insights on data scarcity, external validation, and interpretability. By identifying emerging trends, open challenges, and actionable research directions, this review provides AI scientists, medical imaging researchers, and musculoskeletal clinicians with a clear compass to accelerate rigorous, patient-centered innovation in osteoporosis care. The project page of this survey can also be found on Github.

machine learning, natural language, osteoporosis, (19 more...)

arXiv.org Artificial Intelligence

2510.00061

Country:

Europe (0.46)
Asia (0.45)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Overview (1.00)
Research Report > Strength Medium (0.92)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

Add feedback

CEAID: Benchmark of Multilingual Machine-Generated Text Detection Methods for Central European Languages

Macko, Dominik, Kopal, Jakub

arXiv.org Artificial IntelligenceOct-1-2025

Machine-generated text detection, as an important task, is predominantly focused on English in research. This makes the existing detectors almost unusable for non-English languages, relying purely on cross-lingual transferability. There exist only a few works focused on any of Central European languages, leaving the transferability towards these languages rather unexplored. We fill this gap by providing the first benchmark of detection methods focused on this region, while also providing comparison of train-languages combinations to identify the best performing ones. We focus on multi-domain, multi-generator, and multilingual evaluation, pinpointing the differences of individual aspects, as well as adversarial robustness of detection methods. Supervised finetuned detectors in the Central European languages are found the most performant in these languages as well as the most resistant against obfuscation.

detector, large language model, machine learning, (21 more...)

arXiv.org Artificial Intelligence

2509.26051

Country:

Europe > Austria (0.28)
North America > Mexico (0.28)
Asia > Middle East > UAE (0.28)

Genre: Research Report > Experimental Study (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.82)
(2 more...)

Add feedback