Goto

Collaborating Authors

 classification output


Ensemble Deep Learning and LLM-Assisted Reporting for Automated Skin Lesion Diagnosis

arXiv.org Artificial Intelligence

Cutaneous malignancies demand early detection for favorable outcomes, yet current diagnostics suffer from inter-observer variability and access disparities. While AI shows promise, existing dermatological systems are limited by homogeneous architectures, dataset biases across skin tones, and fragmented approaches that treat natural language processing as separate post-hoc explanations rather than integral to clinical decision-making. We introduce a unified framework that fundamentally reimagines AI integration for dermatological diagnostics through two synergistic innovations. First, a purposefully heterogeneous ensemble of architecturally diverse convolutional neural networks provides complementary diagnostic perspectives, with an intrinsic uncertainty mechanism flagging discordant cases for specialist review -- mimicking clinical best practices. Second, we embed large language model capabilities directly into the diagnostic workflow, transforming classification outputs into clinically meaningful assessments that simultaneously fulfill medical documentation requirements and deliver patient-centered education. This seamless integration generates structured reports featuring precise lesion characterization, accessible diagnostic reasoning, and actionable monitoring guidance -- empowering patients to recognize early warning signs between visits. By addressing both diagnostic reliability and communication barriers within a single cohesive system, our approach bridges the critical translational gap that has prevented previous AI implementations from achieving clinical impact. The framework represents a significant advancement toward deployable dermatological AI that enhances diagnostic precision while actively supporting the continuum of care from initial detection through patient education, ultimately improving early intervention rates for skin lesions.


ActiNet: Activity intensity classification of wrist-worn accelerometers using self-supervised deep learning

arXiv.org Artificial Intelligence

The use of reliable and accurate human activity recognition (HAR) models on passively collected wrist - accelerometer data is essential in large - scale epidemiological studies that investigate the association between physical activity and health outcomes . While the use of self - supervised learning has generated considerable e xcitement in improving HAR, it remains unknown to what extent th ese models, coupled with hidden Markov models (HMMs), would make a tangible improvement to classification performance and the effect this may have on the predicted daily activity intensity compositions . Us ing 151 CAPTURE - 24 participants' data, we trained the ActiNet model, a self - supervised, 18 - layer, modified ResNet - V2 model, followed by hidden Markov model (HMM) smoothing to classify labels of activity intensity . The performance of this model, evaluated using 5 - fold stratified group cross - validation, was then compared to a baseline random forest (RF) + HMM, established in existing literature . Differences in performance and classification outputs were compared with different subgroups of age and sex within the Capture - 24 population. The ActiNet model was able to distinguish labels of activity intensity with a mean macro F1 score of 0.82 and a mean Cohen's kappa score of 0.86 . This exceeded the performance of the RF + HMM, trained and validated on the same dataset, with mean scores of 0.77 and 0.81, respectively . These findings were consistent across subgroups of age and sex. These findings encourage the use of ActiNet for the extraction of activity intensity labels from wrist - accelerometer data in future epidemiological studies.


Attention Guided CAM: Visual Explanations of Vision Transformer Guided by Self-Attention

arXiv.org Artificial Intelligence

Vision Transformer(ViT) is one of the most widely used models in the computer vision field with its great performance on various tasks. In order to fully utilize the ViT-based architecture in various applications, proper visualization methods with a decent localization performance are necessary, but these methods employed in CNN-based models are still not available in ViT due to its unique structure. In this work, we propose an attention-guided visualization method applied to ViT that provides a high-level semantic explanation for its decision. Our method selectively aggregates the gradients directly propagated from the classification output to each self-attention, collecting the contribution of image features extracted from each location of the input image. These gradients are additionally guided by the normalized self-attention scores, which are the pairwise patch correlation scores. They are used to supplement the gradients on the patch-level context information efficiently detected by the self-attention mechanism. This approach of our method provides elaborate high-level semantic explanations with great localization performance only with the class labels. As a result, our method outperforms the previous leading explainability methods of ViT in the weakly-supervised localization task and presents great capability in capturing the full instances of the target class object. Meanwhile, our method provides a visualization that faithfully explains the model, which is demonstrated in the perturbation comparison test.


Certifying the Fairness of KNN in the Presence of Dataset Bias

arXiv.org Artificial Intelligence

We propose a method for certifying the fairness of the classification result of a widely used supervised learning algorithm, the k-nearest neighbors (KNN), under the assumption that the training data may have historical bias caused by systematic mislabeling of samples from a protected minority group. To the best of our knowledge, this is the first certification method for KNN based on three variants of the fairness definition: individual fairness, $\epsilon$-fairness, and label-flipping fairness. We first define the fairness certification problem for KNN and then propose sound approximations of the complex arithmetic computations used in the state-of-the-art KNN algorithm. This is meant to lift the computation results from the concrete domain to an abstract domain, to reduce the computational cost. We show effectiveness of this abstract interpretation based technique through experimental evaluation on six datasets widely used in the fairness research literature. We also show that the method is accurate enough to obtain fairness certifications for a large number of test inputs, despite the presence of historical bias in the datasets.


Testing the Reliability of ChatGPT for Text Annotation and Classification: A Cautionary Remark

arXiv.org Artificial Intelligence

Recent studies have demonstrated promising potential of ChatGPT for various text annotation and classification tasks. However, ChatGPT is non-deterministic which means that, as with human coders, identical input can lead to different outputs. Given this, it seems appropriate to test the reliability of ChatGPT. Therefore, this study investigates the consistency of ChatGPT's zero-shot capabilities for text annotation and classification, focusing on different model parameters, prompt variations, and repetitions of identical inputs. Based on the real-world classification task of differentiating website texts into news and not news, results show that consistency in ChatGPT's classification output can fall short of scientific thresholds for reliability. For example, even minor wording alterations in prompts or repeating the identical input can lead to varying outputs. Although pooling outputs from multiple repetitions can improve reliability, this study advises caution when using ChatGPT for zero-shot text annotation and underscores the need for thorough validation, such as comparison against human-annotated data. The unsupervised application of ChatGPT for text annotation and classification is not recommended.


Backdoor Mitigation in Deep Neural Networks via Strategic Retraining

arXiv.org Artificial Intelligence

Deep Neural Networks (DNN) are becoming increasingly more important in assisted and automated driving. Using such entities which are obtained using machine learning is inevitable: tasks such as recognizing traffic signs cannot be developed reasonably using traditional software development methods. DNN however do have the problem that they are mostly black boxes and therefore hard to understand and debug. One particular problem is that they are prone to hidden backdoors. This means that the DNN misclassifies its input, because it considers properties that should not be decisive for the output. Backdoors may either be introduced by malicious attackers or by inappropriate training. In any case, detecting and removing them is important in the automotive area, as they might lead to safety violations with potentially severe consequences. In this paper, we introduce a novel method to remove backdoors. Our method works for both intentional as well as unintentional backdoors. We also do not require prior knowledge about the shape or distribution of backdoors. Experimental evidence shows that our method performs well on several medium-sized examples.


Adaptive Edge Offloading for Image Classification Under Rate Limit

arXiv.org Artificial Intelligence

This paper considers a setting where embedded devices are used to acquire and classify images. Because of limited computing capacity, embedded devices rely on a parsimonious classification model with uneven accuracy. When local classification is deemed inaccurate, devices can decide to offload the image to an edge server with a more accurate but resource-intensive model. Resource constraints, e.g., network bandwidth, however, require regulating such transmissions to avoid congestion and high latency. The paper investigates this offloading problem when transmissions regulation is through a token bucket, a mechanism commonly used for such purposes. The goal is to devise a lightweight, online offloading policy that optimizes an application-specific metric (e.g., classification accuracy) under the constraints of the token bucket. The paper develops a policy based on a Deep Q-Network (DQN), and demonstrates both its efficacy and the feasibility of its deployment on embedded devices. Of note is the fact that the policy can handle complex input patterns, including correlation in image arrivals and classification accuracy. The evaluation is carried out by performing image classification over a local testbed using synthetic traces generated from the ImageNet image classification benchmark. Implementation of this work is available at https://github.com/qiujiaming315/edgeml-dqn.


Analyzing Hindu Verses with NLP

#artificialintelligence

'Text Classification' is a Machine Learning technique which is used to analyse text and then organize or categorize them based on patterns or structure. Categorization of text has a lot of applications in the world of artificial intelligence such as news article analysis, hate speech identification, gender classification etc. In this article I use'Text Classification' with Natural Language Processing (NLP) using Python to analyze Hindu religious verses and categorize them. Before we delve deeper into the technical side of Python, let's quickly see what data we will be working with. The'Sahasranama' -- literally 1000 names (where'sahasra' means 1000 and'nama' means names)-- is a hymn of praise offered to God in Hinduism.


Analyzing Hindu Verses with NLP

#artificialintelligence

'Text Classification' is a Machine Learning technique which is used to analyse text and then organize or categorize them based on patterns or structure. Categorization of text has a lot of applications in the world of artificial intelligence such as news article analysis, hate speech identification, gender classification etc. In this article I use'Text Classification' with Natural Language Processing (NLP) using Python to analyze Hindu religious verses and categorize them. Before we delve deeper into the technical side of Python, let's quickly see what data we will be working with. The'Sahasranama' -- literally 1000 names (where'sahasra' means 1000 and'nama' means names)-- is a hymn of praise offered to God in Hinduism.


Detecting Early Onset of Depression from Social Media Text using Learned Confidence Scores

arXiv.org Machine Learning

Computational research on mental health disorders from written texts covers an interdisciplinary area between natural language processing and psychology. A crucial aspect of this problem is prevention and early diagnosis, as suicide resulted from depression being the second leading cause of death for young adults. In this work, we focus on methods for detecting the early onset of depression from social media texts, in particular from Reddit. To that end, we explore the eRisk 2018 dataset and achieve good results with regard to the state of the art by leveraging topic analysis and learned confidence scores to guide the decision process.