Goto

Collaborating Authors

 Performance Analysis


Artificial intelligence software can spot child sexual abuse media online Latest News & Updates at Daily News & Analysis

#artificialintelligence

Artificial intelligence software can now help cops spot new or previously unknown child sexual abuse media and prosecute offenders. The toolkit, described in a paper published in Digital Investigation, automatically detects new child sexual abuse photos and videos in online peer-to-peer networks. The new approach combines automatic filename and media analysis techniques in an intelligent filtering module, which can identify new criminal media and distinguish it from other media being shared, such as adult pornography. Spotting newly produced media online can give law enforcement agencies the fresh evidence they need to find and prosecute offenders. "Identifying new child sexual abuse media is critical because it can indicate recent or ongoing child abuse," said lead study author Claudia Peersman from Lancaster University.


Predicting Flights Delay Using Supervised Learning, Logistic Regression

@machinelearnbot

In this post, we'll use a supervised machine learning technique called logistic regression to predict delayed flights. But before we proceed, I like to give condolences to the family of the the victims of the Germanwings tragedy.


Machines are learning to find concealed weapons in X-ray scans

#artificialintelligence

EVERY day more than 8,000 containers flow through the Port of Rotterdam. But only a fraction are selected to pass through a giant x-ray machine to check for illicit contents. The machine, made by Rapiscan, an American firm, can capture images as the containers move along a track at 15kph (9.3mph). But it takes time for a human to inspect each scan for anything suspicious--and in particular for small metallic objects that might be weapons. To increase this inspection rate would require a small army of people.


How to evaluate Data Science models ?

@machinelearnbot

Lift Charts & Gain Charts: These are widely used in campaign targeting problems, to determine which decile can we target customers for a specific campaign. Also, it tells you how much response you can expect from the new target base. ROC Curve: The ROC curve is the plot between false positive rate and True Positive rate. Gini coefficient: This is the ratio of area between the ROC curve and the diagonal line & the area of the above triangle Cross Validation: splitting the data into two parts, where one part is used for "training" your model, and the second part is used to make predictions. By this you can test the model on the data that was "not seen" by it previously, and check how it could possibly behave with external data.


How Should a Society Be?

#artificialintelligence

My academic background is in computer science and philosophy. My work has been about the relationship between those two fields. What do we learn about being human by thinking about the quest to create artificial intelligence? What do we learn about human decision making by thinking of human problems in computational terms? The questions that have interested me over the years have been, on the one hand, what defines human intelligence at a species level? And secondly, at an individual level, how do we approach decision making in our own lives, and what are the problems that the world throws at us? I find myself interested at the group level, the society level, and the civic level in a couple of different ways. I've been encouraged by what I've seen over the last few years in terms of the norms of the sciences changing. It used to be that people were scared to publish their models because that was the secret sauce; that was their advantage over other research groups.


A novel multiclassSVM based framework to classify lithology from well logs: a real-world application

arXiv.org Machine Learning

Support vector machines (SVMs) have been recognized as a potential tool for supervised classification analyses in different domains of research. In essence, SVM is a binary classifier. Therefore, in case of a multiclass problem, the problem is divided into a series of binary problems which are solved by binary classifiers, and finally the classification results are combined following either the one-against-one or one-against-all strategies. In this paper, an attempt has been made to classify lithology using a multiclass SVM based framework using well logs as predictor variables. Here, the lithology is classified into four classes such as sand, shaly sand, sandy shale and shale based on the relative values of sand and shale fractions as suggested by an expert geologist. The available dataset consisting well logs (gamma ray, neutron porosity, density, and P-sonic) and class information from four closely spaced wells from an onshore hydrocarbon field is divided into training and testing sets. We have used one-against-all strategy to combine the results of multiple binary classifiers. The reported results established the superiority of multiclass SVM compared to other classifiers in terms of classification accuracy. The selection of kernel function and associated parameters has also been investigated here. It can be envisaged from the results achieved in this study that the proposed framework based on multiclass SVM can further be used to solve classification problems. In future research endeavor, seismic attributes can be introduced in the framework to classify the lithology throughout a study area from seismic inputs.


A One class Classifier based Framework using SVDD : Application to an Imbalanced Geological Dataset

arXiv.org Machine Learning

Evaluation of hydrocarbon reservoir requires classification of petrophysical properties from available dataset. However, characterization of reservoir attributes is difficult due to the nonlinear and heterogeneous nature of the subsurface physical properties. In this context, present study proposes a generalized one class classification framework based on Support Vector Data Description (SVDD) to classify a reservoir characteristic water saturation into two classes (Class high and Class low) from four logs namely gamma ray, neutron porosity, bulk density, and P sonic using an imbalanced dataset. A comparison is carried out among proposed framework and different supervised classification algorithms in terms of g metric means and execution time. Experimental results show that proposed framework has outperformed other classifiers in terms of these performance evaluators. It is envisaged that the classification analysis performed in this study will be useful in further reservoir modeling.


How Do You Keep People Safe When Peanut Butter Looks The Same As Explosives To An X-Ray?

Forbes - Tech

This has not been shown in public. And no, it does not make coffee. Deep in the heart of the 17,000 attendees at Slush – the insanely busy, loud and dark conference in Helsinki – I found Karsa.fi a small stand with a big claim; they keep bombs off planes. Plane travel – while safer than it has ever been is still a concern for those in the planes as those on the ground. Terrorism – or rather fear of it – is at record levels (despite figures that suggest a decline in major incidents) so Karsa.fi's


Machine Learning for Threat Analytics: A Boost or a Bust?

#artificialintelligence

Trying to discern drug smugglers passing through customs presents exactly the same problem as trying to discern security threats passing through our networks. Machine learning has been applied to both with varying degrees of success, but ultimately the technology reaches the same limitations. Machine learning has two basic elements: feature vectors and classification exemplars -- the data that is gathered and the corresponding classification examples. In the case of drug smugglers, we might observe number of travelers, point of origin, point of destination, number of bags, length of stay and weight of the bags. We might also flag any traveler or pair of travelers with two or more bags whose combined weight is greater than 150 pounds, whose stay is less than a week and who originated from a climate conducive to poppies.


Implementing your own k-nearest neighbour algorithm using Python

#artificialintelligence

In machine learning, you may often wish to build predictors that allows to classify things into categories based on some set of associated values. For example, it is possible to provide a diagnosis to a patient based on data from previous patients. Many algorithms have been developed for automated classification, and common ones include random forests, support vector machines, Naïve Bayes classifiers, and many types of neural networks. To get a feel for how classification works, we take a simple example of a classification algorithm – k-Nearest Neighbours (kNN) – and build it from scratch in Python 2. You can use a mostly imperative style of coding, rather than a declarative/functional one with lambda functions and list comprehensions to keep things simple if you are starting with Python. Here, we will provide an introduction to the latter approach.