AITopics

2006.12285

Country:

Europe > Germany > Hesse > Darmstadt Region > Frankfurt (0.05)
North America > United States > Virginia (0.04)
Asia > China (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Diagnostic Medicine (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)

#artificialintelligenceJul-15-2020, 08:11:10 GMT

Modelling Credit Card Fraud Detection

Credit card frauds are a "still growing" problem in the world. Losses in frauds were estimated in more than US$27 billion in 2018 and are still projected to grow significantly for the next years as this article shows. With more and more people using credit cards in their daily routine, also increased the interest of criminals in opportunities to make money from that. The development of new technologies puts both criminals and credit card companies in a constant race to improve their systems and techniques. With that amount of money at stake, Machine Learning is surely not a new word for credit card companies, which have been investing on that long before it was a trend, to create and optimize models of risk and fraud management.

artificial intelligence, fraud, machine learning, (15 more...)

Industry:

Law Enforcement & Public Safety > Fraud (1.00)
Information Technology (1.00)
Banking & Finance > Credit (1.00)

Technology:

Information Technology > e-Commerce > Financial Technology (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.31)

#artificialintelligenceJul-15-2020, 08:10:32 GMT

AI Learns from Lung CT Scans to Diagnose COVID-19

Although the initial wave of the SARS-CoV-2 pandemic has abated in many countries, healthcare providers are still looking to identify as many COVID-19 patients as possible and contain the disease. Fast and accurate diagnosis is especially important when unsuspecting patients with a coronavirus infection come to the hospital with health complaints but don't yet show symptoms of COVID-19. Nasal swab samples analyzed by RT-PCR are currently recommended for the diagnosis of COVID-19, however, supply shortages, a wait time of up to two days for results, and a false negative rate as high as 1 in 5 mean alternative, large-scale COVID-19 screening tools are still being sought. SARS-CoV-2 is known to damage lung tissue, and in a distinct way that doctors are now seeking to exploit for new diagnostic approaches. Many COVID-19 patients develop pneumonia, which can progress to respiratory failure and sometimes death.

artificial intelligence, machine learning, pneumonia, (18 more...)

Country:

North America > United States (0.16)
Asia > China (0.09)
Asia > Macao (0.06)
(3 more...)

Industry: Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.91)

Patel, Parth, Passi, Kalpdrum, Jain, Chakresh Kumar

Prediction of Cancer Microarray and DNA Methylation Data using Non-negative Matrix Factorization

arXiv.org Machine LearningJul-15-2020

Over the past few years, there has been a considerable spread of microarray technology in many biological patterns, particularly in those pertaining to cancer diseases like leukemia, prostate, colon cancer, etc. The primary bottleneck that one experiences in the proper understanding of such datasets lies in their dimensionality, and thus for an efficient and effective means of studying the same, a reduction in their dimension to a large extent is deemed necessary. This study is a bid to suggesting different algorithms and approaches for the reduction of dimensionality of such microarray datasets. This study exploits the matrix-like structure of such microarray data and uses a popular technique called Non-Negative Matrix Factorization (NMF) to reduce the dimensionality, primarily in the field of biological data. Classification accuracies are then compared for these algorithms. This technique gives an accuracy of 98%.

artificial intelligence, bioinformatics, machine learning, (13 more...)

doi: 10.5121/csit.2020.100906

2007.08652

Country:

Asia > India > NCT > Delhi (0.04)
Asia > India > Gujarat (0.04)
North America > Canada > Ontario > Thunder Bay District > Sudbury (0.04)
(2 more...)

Genre: Research Report > New Finding (0.68)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Oncology > Leukemia (0.51)

Technology:

Information Technology > Biomedical Informatics > Translational Bioinformatics (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.47)

Pascual-Triana, José Daniel, Charte, David, Arroyo, Marta Andrés, Fernández, Alberto, Herrera, Francisco

Revisiting Data Complexity Metrics Based on Morphology for Overlap and Imbalance: Snapshot, New Overlap Number of Balls Metrics and Singular Problems Prospect

arXiv.org Machine LearningJul-15-2020

Data Science and Machine Learning have become fundamental assets for companies and research institutions alike. As one of its fields, supervised classification allows for class prediction of new samples, learning from given training data. However, some properties can cause datasets to be problematic to classify. In order to evaluate a dataset a priori, data complexity metrics have been used extensively. They provide information regarding different intrinsic characteristics of the data, which serve to evaluate classifier compatibility and a course of action that improves performance. However, most complexity metrics focus on just one characteristic of the data, which can be insufficient to properly evaluate the dataset towards the classifiers' performance. In fact, class overlap, a very detrimental feature for the classification process (especially when imbalance among class labels is also present) is hard to assess. This research work focuses on revisiting complexity metrics based on data morphology. In accordance to their nature, the premise is that they provide both good estimates for class overlap, and great correlations with the classification performance. For that purpose, a novel family of metrics have been developed. Being based on ball coverage by classes, they are named after Overlap Number of Balls. Finally, some prospects for the adaptation of the former family of metrics to singular (more complex) problems are discussed.

artificial intelligence, fuzzy logic, machine learning, (18 more...)

2007.07935

Country:

South America > Paraguay > Asunción > Asunción (0.04)
North America > United States > Wisconsin (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre:

Research Report (0.82)
Instructional Material > Course Syllabus & Notes (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.67)

#artificialintelligenceJul-14-2020, 13:05:20 GMT

Traceable raises $20 million for AI system that shields cloud app APIs from cyberattacks

Traceable, a startup developing an end-to-end cloud app security solution, today emerged from stealth with $20 million in venture equity financing. Newly flush with capital, CEO Jyoti Bansal intends to focus on acquiring customers globally while growing Traceable's team and accelerating R&D. Cloud-native apps are often built with hundreds or even thousands of API microservices (i.e., loosely coupled services), making them difficult to protect at scale. Gartner predicts that by 2022, API abuses will be the most frequent attack vector, which isn't surprising considering API calls represented 83% of web traffic as of 2018. Traceable ostensibly protects these APIs with machine learning algorithms that analyze app activity from the user and the session all the way down to the code.

artificial intelligence, machine learning, traceable, (15 more...)

Country: North America > United States > Colorado > Denver County > Denver (0.05)

Industry:

Information Technology > Security & Privacy (0.67)
Government > Military > Cyberwarfare (0.52)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.35)

Petrides, George, Verbeke, Wouter

Misclassification cost-sensitive ensemble learning: A unifying framework

arXiv.org Machine LearningJul-14-2020

The task of supervised machine learning is given a set of recorded observations and their outcomes to predict the outcome of new observations. Standard classification techniques aim for the highest overall accuracy or, equivalently, for the smallest total error, and include among others support vector machines, Bayesian classifiers, logistic regression, decision tree classifiers such as CART [6] and C4.5 [38], and ensemble methods which build several classifiers and aggregate their predictions such as Bagging [4], AdaBoost [16] and Random Forests [5]. Of particular interest in certain domains are binary classifiers which deal with cases where only two classes of outcomes are considered, such as fraudulent and legitimate credit card transactions, responders and non-responders to a marketing campaign, patients with and without cancer, intrusive and authorised network access, and defaulting and repaying debtors to name a few. In most of these cases, one of the classes is a small minority and consequently traditional classifiers might classify all of its members as belonging to the majority class without any significant overall accuracy loss. The severity of this class imbalance becomes more noticeable when failing to correctly predict a minority class member is more costly than doing so with a member of the majority class, as the case often is. A remedy to the undesirable situation just described are classifiers which, instead of accuracy, take misclassification costs into account and are thus termed cost-sensitive. We illustrate this idea in the credit card fraud detection framework: accepting a fraudulent transaction as legitimate incurs a cost equal to its amount.

artificial intelligence, classifier, machine learning, (18 more...)

2007.07361

Country:

Europe > Austria > Vienna (0.14)
North America > United States > California (0.04)
Europe > Norway > Western Norway > Vestland > Bergen (0.04)
Europe > Belgium (0.04)

Genre:

Research Report > New Finding (0.34)
Research Report > Experimental Study (0.34)

Industry:

Law Enforcement & Public Safety > Fraud (1.00)
Banking & Finance > Credit (0.89)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.34)

Garchery, Mathieu, Granitzer, Michael

ADSAGE: Anomaly Detection in Sequences of Attributed Graph Edges applied to insider threat detection at fine-grained level

arXiv.org Machine LearningJul-14-2020

Previous works on the CERT insider threat detection case have neglected graph and text features despite their relevance to describe user behavior. Additionally, existing systems heavily rely on feature engineering and audit data aggregation to detect malicious activities. This is time consuming, requires expert knowledge and prevents tracing back alerts to precise user actions. To address these issues we introduce ADSAGE to detect anomalies in audit log events modeled as graph edges. Our general method is the first to perform anomaly detection at edge level while supporting both edge sequences and attributes, which can be numeric, categorical or even text. We describe how ADSAGE can be used for fine-grained, event level insider threat detection in different audit logs from the CERT use case. Remarking that there is no standard benchmark for the CERT problem, we use a previously proposed evaluation setting based on realistic recall-based metrics. We evaluate ADSAGE on authentication, email traffic and web browsing logs from the CERT insider threat datasets, as well as on real-world authentication events. ADSAGE is effective to detect anomalies in authentications, modeled as user to computer interactions, and in email communications. Simple baselines give surprisingly strong results as well. We also report performance split by malicious scenarios present in the CERT datasets: interestingly, several detectors are complementary and could be combined to improve detection. Overall, our results show that graph features are informative to characterize malicious insider activities, and that detection at fine-grained level is possible.

data mining, detection, machine learning, (18 more...)

2007.06985

Country:

North America > United States > Hawaii (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > New Mexico > Los Alamos County > Los Alamos (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre: Research Report > New Finding (0.87)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Zhou, Weimin, Li, Hua, Anastasio, Mark A.

Approximating the Ideal Observer for joint signal detection and localization tasks by use of supervised learning methods

arXiv.org Machine LearningJul-14-2020

Medical imaging systems are commonly assessed and optimized by use of objective measures of image quality (IQ). The Ideal Observer (IO) performance has been advocated to provide a figure-of-merit for use in assessing and optimizing imaging systems because the IO sets an upper performance limit among all observers. When joint signal detection and localization tasks are considered, the IO that employs a modified generalized likelihood ratio test maximizes observer performance as characterized by the localization receiver operating characteristic (LROC) curve. Computations of likelihood ratios are analytically intractable in the majority of cases. Therefore, sampling-based methods that employ Markov-Chain Monte Carlo (MCMC) techniques have been developed to approximate the likelihood ratios. However, the applications of MCMC methods have been limited to relatively simple object models. Supervised learning-based methods that employ convolutional neural networks have been recently developed to approximate the IO for binary signal detection tasks. In this paper, the ability of supervised learning-based methods to approximate the IO for joint signal detection and localization tasks is explored. Both background-known-exactly and background-known-statistically signal detection and localization tasks are considered. The considered object models include a lumpy object model and a clustered lumpy model, and the considered measurement noise models include Laplacian noise, Gaussian noise, and mixed Poisson-Gaussian noise. The LROC curves produced by the supervised learning-based method are compared to those produced by the MCMC approach or analytical computation when feasible. The potential utility of the proposed method for computing objective measures of IQ for optimizing imaging system performance is explored.

artificial intelligence, detection-localization task, machine learning, (16 more...)

doi: 10.1109/TMI.2020.3009022

2006.00112

Country:

North America > United States > Illinois > Champaign County > Urbana (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > Missouri > St. Louis County > St. Louis (0.04)

Genre: Research Report (0.50)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

#artificialintelligenceJul-13-2020, 19:09:02 GMT

Developing Machine Learning Pipelines

Even the most experienced Data Scientists are not always familiar with the best practices involved with developing a Machine Learning pipeline. There is a lot of confusion about what steps should be involved, what should be their sequence and, in general, how to ensure that the insights you create are accurate and valuable. There is also a very limited number of good resources describing a practical and correct approach. However, after many data science projects, you begin to realise the approach to building a pipeline always remains the same. Machine Learning pipelines are modular, and, depending on the situation, some steps can be added or skipped.

artificial intelligence, information, machine learning, (13 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)