AITopics | Accuracy

Collaborating Authors

Accuracy

News Overviews Instructional Materials AI-Alerts Classics

CelebHair: A New Large-Scale Dataset for Hairstyle Recommendation based on CelebA

Chen, Yutao, Zhang, Yuxuan, Huang, Zhongrui, Luo, Zhenyao, Chen, Jinpeng

arXiv.org Artificial IntelligenceApr-14-2021

In this paper, we present a new large-scale dataset for hairstyle recommendation, CelebHair, based on the celebrity facial attributes dataset, CelebA. Our dataset inherited the majority of facial images along with some beauty-related facial attributes from CelebA. Additionally, we employed facial landmark detection techniques to extract extra features such as nose length and pupillary distance, and deep convolutional neural networks for face shape and hairstyle classification. Empirical comparison has demonstrated the superiority of our dataset to other existing hairstyle-related datasets regarding variety, veracity, and volume. Analysis and experiments have been conducted on the dataset in order to evaluate its robustness and usability.

dataset, face shape, hairstyle recommendation, (12 more...)

arXiv.org Artificial Intelligence

2104.06885

Country: Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

Image Manipulation Detection by Multi-View Multi-Scale Supervision

Chen, Xinru, Dong, Chengbo, Ji, Jiaqi, Cao, Juan, Li, Xirong

arXiv.org Artificial IntelligenceApr-14-2021

The key challenge of image manipulation detection is how to learn generalizable features that are sensitive to manipulations in novel data, whilst specific to prevent false alarms on authentic images. Current research emphasizes the sensitivity, with the specificity overlooked. In this paper we address both aspects by multi-view feature learning and multi-scale supervision. By exploiting noise distribution and boundary artifact surrounding tampered regions, the former aims to learn semantic-agnostic and thus more generalizable features. The latter allows us to learn from authentic images which are nontrivial to taken into account by current semantic segmentation network based methods. Our thoughts are realized by a new network which we term MVSS-Net. Extensive experiments on five benchmark sets justify the viability of MVSS-Net for both pixel-level and image-level manipulation detection.

detection, manipulation detection, mvss-net, (16 more...)

arXiv.org Artificial Intelligence

2104.06832

Country: Asia > China (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

Thresholded Graphical Lasso Adjusts for Latent Variables: Application to Functional Neural Connectivity

Wang, Minjie, Allen, Genevera I.

arXiv.org Machine LearningApr-13-2021

Emerging neuroscience technologies such as electrophysiology and calcium imaging can record from tens-of-thousands of neurons in the live animal brain while the animal is responding to stimuli and behaving freely. Scientists often seek to understand how neurons are communicating during certain stimuli or activities, something termed functional neural connectivity. To learn functional connections from large-scale neuroscience data, many have proposed using probabilistic graphical models (Yatsenko et al. 2015; Narayan et al. 2015; Chang et al. 2019), where each edge denotes conditional dependencies between nodes. Yet, applying such models in neuroscience poses a major challenge as only a small subset of neurons in the animal brain can be recorded at once, leading to abundant latent variables. Chandrasekaran et al. (2012) termed this the latent variable graphical model problem and proposed a convex program to solve this. While conceptually attractive, this approach poses several statistical, computational and practical challenges, discussed subsequently, for the task of learning functional neural connectivity from large-scale neuroscience data. Because of this, we are motivated to consider an incredibly simple solution to the latent variable graphical model problem: apply a hard thresholding operator to existing graph selection estimators. In this paper, we study this approach showing that thresholding has more desirable theoretical properties as well as superior empirical performance.

graphical lasso, graphical model, latent variable, (13 more...)

arXiv.org Machine Learning

2104.06389

Country: North America > United States > Texas > Harris County > Houston (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.46)

Add feedback

A Semi-Supervised Classification Method of Apicomplexan Parasites and Host Cell Using Contrastive Learning Strategy

Ren, Yanni, Deng, Hangyu, Jiang, Hao, Hu, Jinglu

arXiv.org Artificial IntelligenceApr-13-2021

A common shortfall of supervised learning for medical imaging is the greedy need for human annotations, which is often expensive and time-consuming to obtain. This paper proposes a semi-supervised classification method for three kinds of apicomplexan parasites and non-infected host cells microscopic images, which uses a small number of labeled data and a large number of unlabeled data for training. There are two challenges in microscopic image recognition. The first is that salient structures of the microscopic images are more fuzzy and intricate than natural images' on a real-world scale. The second is that insignificant textures, like background staining, lightness, and contrast level, vary a lot in samples from different clinical scenarios. To address these challenges, we aim to learn a distinguishable and appearance-invariant representation by contrastive learning strategy. On one hand, macroscopic images, which share similar shape characteristics in morphology, are introduced to contrast for structure enhancement. On the other hand, different appearance transformations, including color distortion and flittering, are utilized to contrast for texture elimination. In the case where only 1% of microscopic images are labeled, the proposed method reaches an accuracy of 94.90% in a generalized testing set.

macro data, micro data, representation, (12 more...)

arXiv.org Artificial Intelligence

2104.06593

Country:

North America > United States > Nevada > Clark County > Las Vegas (0.04)
North America > United States > Hawaii (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > Japan (0.04)

Genre: Research Report (0.40)

Industry:

Health & Medicine > Therapeutic Area (0.49)
Health & Medicine > Diagnostic Medicine > Imaging (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

Auto-Validate: Unsupervised Data Validation Using Data-Domain Patterns Inferred from Data Lakes

Song, Jie, He, Yeye

arXiv.org Artificial IntelligenceApr-13-2021

Complex data pipelines are increasingly common in diverse applications such as BI reporting and ML modeling. These pipelines often recur regularly (e.g., daily or weekly), as BI reports need to be refreshed, and ML models need to be retrained. However, it is widely reported that in complex production pipelines, upstream data feeds can change in unexpected ways, causing downstream applications to break silently that are expensive to resolve. Data validation has thus become an important topic, as evidenced by notable recent efforts from Google and Amazon, where the objective is to catch data quality issues early as they arise in the pipelines. Our experience on production data suggests, however, that on string-valued data, these existing approaches yield high false-positive rates and frequently require human intervention. In this work, we develop a corpus-driven approach to auto-validate \emph{machine-generated data} by inferring suitable data-validation "patterns" that accurately describe the underlying data domain, which minimizes false positives while maximizing data quality issues caught. Evaluations using production data from real data lakes suggest that Auto-Validate is substantially more effective than existing methods. Part of this technology ships as an Auto-Tag feature in Microsoft Azure Purview.

digit, proceedings, query column, (14 more...)

arXiv.org Artificial Intelligence

2104.04659

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Michigan (0.04)
Asia > China (0.04)

Genre: Research Report (1.00)

Industry:

Information Technology (0.48)
Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Data Science > Data Quality (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Data-driven Design of Context-aware Monitors for Hazard Prediction in Artificial Pancreas Systems

Zhou, Xugui, Ahmed, Bulbul, Aylor, James H., Asare, Philip, Alemzadeh, Homa

arXiv.org Artificial IntelligenceApr-13-2021

Medical Cyber-physical Systems (MCPS) are vulnerable to accidental or malicious faults that can target their controllers and cause safety hazards and harm to patients. This paper proposes a combined model and data-driven approach for designing context-aware monitors that can detect early signs of hazards and mitigate them in MCPS. We present a framework for formal specification of unsafe system context using Signal Temporal Logic (STL) combined with an optimization method for patient-specific refinement of STL formulas based on real or simulated faulty data from the closed-loop system for the generation of monitor logic. We evaluate our approach in simulation using two state-of-the-art closed-loop Artificial Pancreas Systems (APS). The results show the context-aware monitor achieves up to 1.4 times increase in average hazard prediction accuracy (F1-score) over several baseline monitors, reduces false-positive and false-negative rates, and enables hazard mitigation with a 54% success rate while decreasing the average risk for patients.

cawt monitor, controller, hazard, (16 more...)

arXiv.org Artificial Intelligence

2104.02545

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Virginia > Albemarle County > Charlottesville (0.14)
North America > United States > Florida > Alachua County > Gainesville (0.14)
(5 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Health Care Technology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

A Conceptual Framework for Establishing Trust in Real World Intelligent Systems

Guckert, Michael, Gumpfer, Nils, Hannig, Jennifer, Keller, Till, Urquhart, Neil

arXiv.org Artificial IntelligenceApr-12-2021

Abstract: Intelligent information systems that contain emergent elements often encounter trust problems because results do not get sufficiently explained and the procedure itself can not be fully retraced. This is caused by a control flow depending either on stochastic elements or on the structure and relevance of the input data. Trust in such algorithms can be established by letting users interact with the system so that they can explore results and find patterns that can be compared with their expected solution. Reflecting features and patterns of human understanding of a domain against algorithmic results can create awareness of such patterns and may increase the trust that a user has in the solution. If expectations are not met, close inspection can be used to decide whether a solution conforms to the expectations or whether it goes beyond the expected. By either accepting or rejecting a solution, the user's set of expectations evolves and a learning process for the users is established. In this paper we present a conceptual framework that reflects and supports this process. The framework is the result of an analysis of two exemplary case studies from two different disciplines with information systems that assist experts in their complex tasks. Keywords: Intelligent Systems, AI, Trust, Explainable AI, Knowledge Management, Knowledge Patterns 1. INTRODUCTION uncommon and have been constructed in uncommon ways. Such techniques, a class to which systems that we now Human expertise in many aspects is largely based on call intelligent systems belong to, produce results of high prior knowledge and familiar patterns, which have either complexity (e.g.

algorithm, explanation, knowledge, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.cogsys.2021.04.001

2104.05432

Country:

Europe > Germany (0.28)
Oceania > Australia > New South Wales > Sydney (0.14)
North America > United States > New York > New York County > New York City (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry:

Transportation (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Diagnostic Medicine (1.00)
Information Technology (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.68)
Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
(3 more...)

Add feedback

Semi-Supervised Learning of Classifiers from a Statistical Perspective: A Brief Review

Ahfock, Daniel, McLachlan, Geoffrey J.

arXiv.org Machine LearningApr-12-2021

Due to the scarcity and often high acquisition cost of labelled data, machine learning methods that make effective use of large quantities of unlabelled data are being increasingly used. One such method is semi-supervised learning (SSL) where, in addition to labelled data, possibly large numbers of unlabelled observations are available at the time of the construction of the classification rule (classifier) to be used. Not surprisingly, semisupervised learning approaches have been gaining much attention in both the application oriented and the theoretical machine learning communities. However, theoretical analysis of SSL has so far been scarce. But last year, Ahfock and McLachlan (2020) provided an asymptotic basis on how to increase in certain situations the accuracy of the commonly used linear discriminant function formed from a partially classified sample as in SSL (Ahfock and McLachlan, 2020). The increase in accuracy can be of sufficient magnitude for this SSL-based classifier to have smaller error rate than that if it were formed from a completely classified sample.

entropy, mclachlan, probability, (15 more...)

arXiv.org Machine Learning

2104.04046

Country:

North America > United States > New Jersey > Hudson County > Hoboken (0.14)
North America > United States > New York (0.04)
Asia > Middle East > Jordan (0.04)
(4 more...)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine > Therapeutic Area (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.95)

Add feedback

Artificial Intelligence Methods Based Hierarchical Classification of Frontotemporal Dementia to Improve Diagnostic Predictability

Poonam, Km, Guha, Rajlakshmi, Chakrabarti, Partha P

arXiv.org Artificial IntelligenceApr-12-2021

Patients with Frontotemporal Dementia (FTD) have impaired cognitive abilities, executive and behavioral traits, loss of language ability, and decreased memory capabilities. Based on the distinct patterns of cortical atrophy and symptoms, the FTD spectrum primarily includes three variants: behavioral variant FTD (bvFTD), non-fluent variant primary progressive aphasia (nfvPPA), and semantic variant primary progressive aphasia (svPPA). The purpose of this study is to classify MRI images of every single subject into one of the spectrums of the FTD in a hierarchical order by applying data-driven techniques of Artificial Intelligence (AI) on cortical thickness data. This data is computed by FreeSurfer software. We used the Smallest Univalue Segment Assimilating Nucleus (SUSAN) technique to minimize the noise in cortical thickness data. Specifically, we took 204 subjects from the frontotemporal lobar degeneration neuroimaging initiative (NIFTD) database to validate this approach, and each subject was diagnosed in one of the diagnostic categories (bvFTD, svPPA, nfvPPA and cognitively normal). Our proposed automated classification model yielded classification accuracy of 86.5, 76, and 72.7 with support vector machine (SVM), linear discriminant analysis (LDA), and Naive Bayes methods, respectively, in 10-fold cross-validation analysis, which is a significant improvement on a traditional single multi-class model with an accuracy of 82.7, 73.4, and 69.2.

accuracy, dementia, thickness, (13 more...)

arXiv.org Artificial Intelligence

2104.05235

Country:

North America > United States > District of Columbia > Washington (0.04)
Asia > India > West Bengal > Kharagpur (0.04)

Genre: Research Report (0.83)

Industry: Health & Medicine > Therapeutic Area > Neurology > Dementia (0.76)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.50)

Add feedback

Parallel integrative learning for large-scale multi-response regression with incomplete outcomes

Dong, Ruipeng, Li, Daoji, Zheng, Zemin

arXiv.org Machine LearningApr-11-2021

Multi-task learning is increasingly used to investigate the association structure between multiple responses and a single set of predictor variables in many applications. In the era of big data, the coexistence of incomplete outcomes, large number of responses, and high dimensionality in predictors poses unprecedented challenges in estimation, prediction, and computation. In this paper, we propose a scalable and computationally efficient procedure, called PEER, for large-scale multi-response regression with incomplete outcomes, where both the numbers of responses and predictors can be high-dimensional. Motivated by sparse factor regression, we convert the multi-response regression into a set of univariate-response regressions, which can be efficiently implemented in parallel. Under some mild regularity conditions, we show that PEER enjoys nice sampling properties including consistency in estimation, prediction, and variable selection. Extensive simulation studies show that our proposal compares favorably with several existing methods in estimation accuracy, variable selection, and computation efficiency.

condition 3, incomplete outcome, regression, (16 more...)

arXiv.org Machine Learning

doi: 10.1016/j.csda.2021.107243

2104.05076

Country:

Asia > China > Anhui Province > Hefei (0.04)
North America > United States > California > Orange County > Fullerton (0.04)

Genre: Research Report (0.82)

Industry: Health & Medicine (0.93)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.68)

Add feedback