AITopics | Accuracy

Collaborating Authors

Accuracy

News Overviews Instructional Materials AI-Alerts Classics

Can we Estimate Truck Accident Risk from Telemetric Data using Machine Learning?

Hébert, Antoine, Marineau, Ian, Gervais, Gilles, Glatard, Tristan, Jaumard, Brigitte

arXiv.org Machine LearningJul-17-2020

Road accidents have a high societal cost that could be reduced through improved risk predictions using machine learning. This study investigates whether telemetric data collected on long-distance trucks can be used to predict the risk of accidents associated with a driver. We use a dataset provided by a truck transportation company containing the driving data of 1,141 drivers for 18 months. We evaluate two different machine learning approaches to perform this task. In the first approach, features are extracted from the time series data using the FRESH algorithm and then used to estimate the risk using Random Forests. In the second approach, we use a convolutional neural network to directly estimate the risk from the time-series data. We find that neither approach is able to successfully estimate the risk of accidents on this dataset, in spite of many methodological attempts. We discuss the difficulties of using telemetric data for the estimation of the risk of accidents that could explain this negative result.

accident, artificial intelligence, machine learning, (18 more...)

arXiv.org Machine Learning

2007.09167

Country:

North America > Canada > Quebec > Montreal (0.04)
North America > United States > New York (0.04)

Genre: Research Report > New Finding (0.68)

Industry:

Transportation > Ground > Road (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Dealing with Nuisance Parameters using Machine Learning in High Energy Physics: a Review

Dorigo, Tommaso, de Castro, Pablo

arXiv.org Machine LearningJul-17-2020

Of these, probably the most common is the use of supervised classification to construct low-dimensional event summaries, which are informative to carry out statistical inference for a given set of parameters of interest. The learned summary statistics -functions of the data that are informative on their relevant properties-can efficiently combine high-dimensional information from each event into one or a few variables which can be used as the basis of statistical inference. The informational source for this compression are simulated observations produced by a complex generative model; the latter reproduces the chain of physical processes occurring in subatomic collisions and the subsequent interaction of the produced final state particles with the detection elements.

artificial intelligence, machine learning, nuisance parameter, (18 more...)

arXiv.org Machine Learning

2007.09121

Country: Europe > Italy (0.04)

Genre:

Research Report (1.00)
Overview (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Multi-Classifier selection-fusion framework: application to NDT of complex metallic parts

Yaghoubi, Vahid, Cheng, Liangliang, Van Paepegem, Wim, Kersemans, Mathias

arXiv.org Machine LearningJul-17-2020

Recent advances in computational methods, material science, and manufacturing technologies reveal promising potentials for using geometrically complex parts to optimize the performance of structural systems. However, this potential has not yet been activated partly due to the immaturity of nondestructive testing (NDT) of such complex parts. Process compensated resonance testing (PCRT) is one of the methods that are in the focus of researchers for this purpose. The key to success for the PCRT approach is to use high-frequency vibration data in conjunction with statistical pattern recognition methods for supervised classification of parts in terms of their structural quality. In this paper, a multi classifier selection-fusion framework based on the Dempster-Shafer theory is proposed. Two new weighting approaches are introduced to enhance the fusion performance, and as such the classification performance. The effectiveness of the proposed framework is validated by its application to six UCI machine learning datasets and one experimental dataset collected from polycrystalline Nickel alloy first-stage turbine blades with a variety of damage features. Comparison with four state-of-the-art fusion techniques shows the good performance of the introduced classifier selection-fusion framework.

artificial intelligence, classifier, machine learning, (17 more...)

arXiv.org Machine Learning

2007.08789

Country:

Europe > Belgium > Flanders (0.04)
North America > United States > Wisconsin (0.04)
North America > United States > Ohio > Franklin County > Columbus (0.04)
Europe > Switzerland > Basel-City > Basel (0.04)

Genre: Research Report (0.64)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.48)

Add feedback

Technologies for Trustworthy Machine Learning: A Survey in a Socio-Technical Context

Toreini, Ehsan, Aitken, Mhairi, Coopamootoo, Kovila P. L., Elliott, Karen, Zelaya, Vladimiro Gonzalez, Missier, Paolo, Ng, Magdalene, van Moorsel, Aad

arXiv.org Artificial IntelligenceJul-17-2020

Concerns about the societal impact of AI-based services and systems has encouraged governments and other organisations around the world to propose AI policy frameworks to address fairness, accountability, transparency and related topics. To achieve the objectives of these frameworks, the data and software engineers who build machine-learning systems require knowledge about a variety of relevant supporting tools and techniques. In this paper we provide an overview of technologies that support building trustworthy machine learning systems, i.e., systems whose properties justify that people place trust in them. We argue that four categories of system properties are instrumental in achieving the policy objectives, namely fairness, explainability, auditability and safety & security (FEAS). We discuss how these properties need to be considered across all stages of the machine learning life cycle, from data collection through run-time model inference. As a consequence, we survey in this paper the main technologies with respect to all four of the FEAS properties, for data-centric as well as model-centric stages of the machine learning system life cycle. We conclude with an identification of open research problems, with a particular focus on the connection between trustworthy machine learning technologies and their implications for individuals and society.

artificial intelligence, machine learning, proceedings, (14 more...)

arXiv.org Artificial Intelligence

2007.08911

Country:

North America > United States (1.00)
Asia (1.00)
Europe > United Kingdom (0.93)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
(2 more...)

Add feedback

Human-Expert-Level Brain Tumor Detection Using Deep Learning with Data Distillation and Augmentation

Lu, Diyuan, Polomac, Nenad, Gacheva, Iskra, Hattingen, Elke, Triesch, Jochen

arXiv.org Machine LearningJul-16-2020

The application of Deep Learning (DL) for medical diagnosis is often hampered by two problems. First, the amount of training data may be scarce, as it is limited by the number of patients who have acquired the condition to be diagnosed. Second, the training data may be corrupted by various types of noise. Here, we study the problem of brain tumor detection from magnetic resonance spectroscopy (MRS) data, where both types of problems are prominent. To overcome these challenges, we propose a new method for training a deep neural network that distills particularly representative training examples and augments the training data by mixing these samples from one class with those from the same and other classes to create additional training samples. We demonstrate that this technique substantially improves performance, allowing our method to reach human-expert-level accuracy with just a few thousand training examples. Interestingly, the network learns to rely on features of the data that are usually ignored by human experts, suggesting new directions for future research.

artificial intelligence, inductive learning, machine learning, (18 more...)

arXiv.org Machine Learning

2006.12285

Country:

Europe > Germany > Hesse > Darmstadt Region > Frankfurt (0.05)
North America > United States > Virginia (0.04)
Asia > China (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Diagnostic Medicine (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)

Add feedback

Modelling Credit Card Fraud Detection

#artificialintelligenceJul-15-2020, 08:11:10 GMT

Credit card frauds are a "still growing" problem in the world. Losses in frauds were estimated in more than US$27 billion in 2018 and are still projected to grow significantly for the next years as this article shows. With more and more people using credit cards in their daily routine, also increased the interest of criminals in opportunities to make money from that. The development of new technologies puts both criminals and credit card companies in a constant race to improve their systems and techniques. With that amount of money at stake, Machine Learning is surely not a new word for credit card companies, which have been investing on that long before it was a trend, to create and optimize models of risk and fraud management.

artificial intelligence, fraud, machine learning, (15 more...)

#artificialintelligence

Industry:

Law Enforcement & Public Safety > Fraud (1.00)
Information Technology (1.00)
Banking & Finance > Credit (1.00)

Technology:

Information Technology > e-Commerce > Financial Technology (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.31)

Add feedback

AI Learns from Lung CT Scans to Diagnose COVID-19

#artificialintelligenceJul-15-2020, 08:10:32 GMT

Although the initial wave of the SARS-CoV-2 pandemic has abated in many countries, healthcare providers are still looking to identify as many COVID-19 patients as possible and contain the disease. Fast and accurate diagnosis is especially important when unsuspecting patients with a coronavirus infection come to the hospital with health complaints but don't yet show symptoms of COVID-19. Nasal swab samples analyzed by RT-PCR are currently recommended for the diagnosis of COVID-19, however, supply shortages, a wait time of up to two days for results, and a false negative rate as high as 1 in 5 mean alternative, large-scale COVID-19 screening tools are still being sought. SARS-CoV-2 is known to damage lung tissue, and in a distinct way that doctors are now seeking to exploit for new diagnostic approaches. Many COVID-19 patients develop pneumonia, which can progress to respiratory failure and sometimes death.

artificial intelligence, machine learning, pneumonia, (18 more...)

#artificialintelligence

Country:

North America > United States (0.16)
Asia > China (0.09)
Asia > Macao (0.06)
(3 more...)

Industry: Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.91)

Add feedback

Prediction of Cancer Microarray and DNA Methylation Data using Non-negative Matrix Factorization

Patel, Parth, Passi, Kalpdrum, Jain, Chakresh Kumar

arXiv.org Machine LearningJul-15-2020

Over the past few years, there has been a considerable spread of microarray technology in many biological patterns, particularly in those pertaining to cancer diseases like leukemia, prostate, colon cancer, etc. The primary bottleneck that one experiences in the proper understanding of such datasets lies in their dimensionality, and thus for an efficient and effective means of studying the same, a reduction in their dimension to a large extent is deemed necessary. This study is a bid to suggesting different algorithms and approaches for the reduction of dimensionality of such microarray datasets. This study exploits the matrix-like structure of such microarray data and uses a popular technique called Non-Negative Matrix Factorization (NMF) to reduce the dimensionality, primarily in the field of biological data. Classification accuracies are then compared for these algorithms. This technique gives an accuracy of 98%.

artificial intelligence, bioinformatics, machine learning, (13 more...)

arXiv.org Machine Learning

doi: 10.5121/csit.2020.100906

2007.08652

Country:

Asia > India > NCT > Delhi (0.04)
Asia > India > Gujarat (0.04)
North America > Canada > Ontario > Thunder Bay District > Sudbury (0.04)
(2 more...)

Genre: Research Report > New Finding (0.68)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Oncology > Leukemia (0.51)

Technology:

Information Technology > Biomedical Informatics > Translational Bioinformatics (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.47)

Add feedback

Revisiting Data Complexity Metrics Based on Morphology for Overlap and Imbalance: Snapshot, New Overlap Number of Balls Metrics and Singular Problems Prospect

Pascual-Triana, José Daniel, Charte, David, Arroyo, Marta Andrés, Fernández, Alberto, Herrera, Francisco

arXiv.org Machine LearningJul-15-2020

Data Science and Machine Learning have become fundamental assets for companies and research institutions alike. As one of its fields, supervised classification allows for class prediction of new samples, learning from given training data. However, some properties can cause datasets to be problematic to classify. In order to evaluate a dataset a priori, data complexity metrics have been used extensively. They provide information regarding different intrinsic characteristics of the data, which serve to evaluate classifier compatibility and a course of action that improves performance. However, most complexity metrics focus on just one characteristic of the data, which can be insufficient to properly evaluate the dataset towards the classifiers' performance. In fact, class overlap, a very detrimental feature for the classification process (especially when imbalance among class labels is also present) is hard to assess. This research work focuses on revisiting complexity metrics based on data morphology. In accordance to their nature, the premise is that they provide both good estimates for class overlap, and great correlations with the classification performance. For that purpose, a novel family of metrics have been developed. Being based on ball coverage by classes, they are named after Overlap Number of Balls. Finally, some prospects for the adaptation of the former family of metrics to singular (more complex) problems are discussed.

artificial intelligence, fuzzy logic, machine learning, (18 more...)

arXiv.org Machine Learning

2007.07935

Country:

South America > Paraguay > Asunción > Asunción (0.04)
North America > United States > Wisconsin (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre:

Research Report (0.82)
Instructional Material > Course Syllabus & Notes (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.67)

Add feedback

Traceable raises $20 million for AI system that shields cloud app APIs from cyberattacks

#artificialintelligenceJul-14-2020, 13:05:20 GMT

Traceable, a startup developing an end-to-end cloud app security solution, today emerged from stealth with $20 million in venture equity financing. Newly flush with capital, CEO Jyoti Bansal intends to focus on acquiring customers globally while growing Traceable's team and accelerating R&D. Cloud-native apps are often built with hundreds or even thousands of API microservices (i.e., loosely coupled services), making them difficult to protect at scale. Gartner predicts that by 2022, API abuses will be the most frequent attack vector, which isn't surprising considering API calls represented 83% of web traffic as of 2018. Traceable ostensibly protects these APIs with machine learning algorithms that analyze app activity from the user and the session all the way down to the code.

artificial intelligence, machine learning, traceable, (15 more...)

#artificialintelligence

Country: North America > United States > Colorado > Denver County > Denver (0.05)

Industry:

Information Technology > Security & Privacy (0.67)
Government > Military > Cyberwarfare (0.52)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.35)

Add feedback