AITopics

1810.05526

Country: Europe > Netherlands (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.94)

Steorts, Rebecca C., Shrivastava, Anshumali

Probabilistic Blocking with An Application to the Syrian Conflict

arXiv.org Machine LearningOct-10-2018

Entity resolution seeks to merge databases as to remove duplicate entries where unique identifiers are typically unknown. We review modern blocking approaches for entity resolution, focusing on those based upon locality sensitive hashing (LSH). First, we introduce $k$-means locality sensitive hashing (KLSH), which is based upon the information retrieval literature and clusters similar records into blocks using a vector-space representation and projections. Second, we introduce a subquadratic variant of LSH to the literature, known as Densified One Permutation Hashing (DOPH). Third, we propose a weighted variant of DOPH. We illustrate each method on an application to a subset of the ongoing Syrian conflict, giving a discussion of each method.

data mining, information retrieval, machine learning, (18 more...)

1810.05497

Country: Asia > Middle East > Syria (0.87)

Genre:

Overview (0.66)
Research Report (0.40)

Industry:

Government > Regional Government > Asia Government > Middle East Government > Syria Government (0.63)
Government > Military (0.63)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.77)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Slawinski, Michael A., Wortman, Andy

Applications of PageRank to Function Comparison and Malware Classification

arXiv.org Artificial IntelligenceOct-10-2018

We classify .NET files as either benign or malicious by examining certain directed graphs extracted from the files via decompilation. Each graph is viewed probabilistically as a Markov chain where each node heuristically represents the possible state of the running file, and by computing the PageRank vector (Perron vector with transport) we can assign a probability measure over the nodes of the given graph. We train a random forest with features derived from computing Lebesgue antiderivatives of functions defined over the vertex sets of the graphs listed above against the PageRank measure. The model was trained on 2.5 million samples of .NET and has an accuracy of 98.3\% on test data. The median time needed for decompilation and scoring was 24ms.

artificial intelligence, information management, machine learning, (16 more...)

arXiv.org Artificial Intelligence

1810.04789

Genre: Research Report (0.50)

Industry: Information Technology > Security & Privacy (0.51)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.35)

#artificialintelligenceOct-9-2018, 19:30:27 GMT

All about Naive Bayes – Towards Data Science

Naive Bayes is the most simple algorithm that you can apply to your data. As the name suggests, here this algorithm makes an assumption as all the variables in the dataset is "Naive" i.e not correlated to each other. Naive Bayes is a very popular classification algorithm that is mostly used to get the base accuracy of the dataset. Let's assume that you are walking on the playground. Now you see some red object in front of you.

artificial intelligence, data science, machine learning, (3 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.87)

Bello, Ghalib A., Dawes, Timothy J. W., Duan, Jinming, Biffi, Carlo, de Marvao, Antonio, Howard, Luke S. G. E., Gibbs, J. Simon R., Wilkins, Martin R., Cook, Stuart A., Rueckert, Daniel, O'Regan, Declan P.

Deep learning cardiac motion analysis for human survival prediction

arXiv.org Machine LearningOct-8-2018

Motion analysis is used in computer vision to understand the behaviour of moving objects in sequences of images. Optimising the interpretation of dynamic biological systems requires accurate and precise motion tracking as well as efficient representations of high-dimensional motion trajectories so that these can be used for prediction tasks. Here we use image sequences of the heart, acquired using cardiac magnetic resonance imaging, to create time-resolved three-dimensional segmentations using a fully convolutional network trained on anatomical shape priors. This dense motion model formed the input to a supervised denoising autoencoder (4Dsurvival), which is a hybrid network consisting of an autoencoder that learns a task-specific latent code representation trained on observed outcome data, yielding a latent representation optimised for survival prediction. To handle right-censored survival outcomes, our network used a Cox partial likelihood loss function. In a study of 302 patients the predictive accuracy (quantified by Harrell's C-index) was significantly higher (p < .0001) for our model C=0.73 (95$\%$ CI: 0.68 - 0.78) than the human benchmark of C=0.59 (95$\%$ CI: 0.53 - 0.65). This work demonstrates how a complex computer vision task using high-dimensional medical image data can efficiently predict human survival.

artificial intelligence, machine learning, representation, (16 more...)

1810.03382

Country:

North America (0.28)
Europe > United Kingdom (0.14)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Liu, Luoluo, Chin, Sang Peter, Tran, Trac D.

JOBS: Joint-Sparse Optimization from Bootstrap Samples

arXiv.org Machine LearningOct-8-2018

Classical signal recovery based on $\ell_1$ minimization solves the least squares problem with all available measurements via sparsity-promoting regularization. In practice, it is often the case that not all measurements are available or required for recovery. Measurements might be corrupted/missing or they arrive sequentially in streaming fashion. In this paper, we propose a global sparse recovery strategy based on subsets of measurements, named JOBS, in which multiple measurements vectors are generated from the original pool of measurements via bootstrapping, and then a joint-sparse constraint is enforced to ensure support consistency among multiple predictors. The final estimate is obtained by averaging over the $K$ predictors. The performance limits associated with different choices of number of bootstrap samples $L$ and number of estimates $K$ is analyzed theoretically. Simulation results validate some of the theoretical analysis, and show that the proposed method yields state-of-the-art recovery performance, outperforming $\ell_1$ minimization and a few other existing bootstrap-based techniques in the challenging case of low levels of measurements and is preferable over other bagging-based methods in the streaming setting since it performs better with small $K$ and $L$ for data-sets with large sizes.

artificial intelligence, machine learning, minimization, (18 more...)

1810.03743

Country:

North America > United States (0.46)
Europe > Switzerland (0.28)

Genre: Research Report > New Finding (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.34)

arXiv.org Artificial IntelligenceOct-8-2018

A Unified Dynamic Approach to Sparse Model Selection

Huang, Chendi, Yao, Yuan

Sparse model selection is ubiquitous from linear regression to graphical models where regularization paths, as a family of estimators upon the regularization parameter varying, are computed when the regularization parameter is unknown or decided data-adaptively. Traditional computational methods rely on solving a set of optimization problems where the regularization parameters are fixed on a grid that might be inefficient. In this paper, we introduce a simple iterative regularization path, which follows the dynamics of a sparse Mirror Descent algorithm or a generalization of Linearized Bregman Iterations with nonlinear loss. Its performance is competitive to \texttt{glmnet} with a further bias reduction. A path consistency theory is presented that under the Restricted Strong Convexity (RSC) and the Irrepresentable Condition (IRR), the path will first evolve in a subspace with no false positives and reach an estimator that is sign-consistent or of minimax optimal $\ell_2$ error rate. Early stopping regularization is required to prevent overfitting. Application examples are given in sparse logistic regression and Ising models for NIPS coauthorship.

artificial intelligence, exp, machine learning, (17 more...)

arXiv.org Artificial Intelligence

1810.03608

Country:

Asia > China (0.28)
North America > United States (0.28)

Genre: Research Report (0.91)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.54)

#artificialintelligenceOct-6-2018, 19:37:24 GMT

How Alexa Is Learning to Converse More Naturally : Alexa Blogs

To handle more-natural spoken interactions, Alexa must track references through several rounds of conversation. If, for instance, a customer says, "How far is it to Redmond?" and after the answer follows up by saying, "Find good Indian restaurants there", Alexa should be able to infer that "there" refers to Redmond. We call the task of reference tracking "context carryover," and it's a capability that is currently being phased in to the Alexa experience. At this year's Interspeech, the largest conference on spoken-language understanding, my colleagues and I will present a paper titled "Contextual Slot Carryover for Disparate Schemas," which describes our solution to the problem of slot carryover, a crucial aspect of context carryover. "Domain" describes the type of application -- or "skill" -- that the utterance should invoke; for instance, mapping skills should answer questions about geographic distance.

artificial intelligence, machine learning, utterance, (14 more...)

Industry:

Retail > Online (0.40)
Consumer Products & Services > Restaurants (0.36)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.49)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.38)

#artificialintelligenceOct-5-2018, 17:28:23 GMT

Stanford AI detects even the smallest earthquakes from seismic data

Microearthquakes -- low-intensity earthquakes that register 2.0 or less magnitude on the moment magnitude scale -- rarely cause property damage. And as a result of background noise, small events, and false positives, they're not always picked up by seismic monitoring systems. A possible solution is described in a new paper from the Department of Geophysics at Stanford University, where scientists have developed an AI system -- dubbed Cnn-Rnn Earthquake Detector, or CRED -- that can isolate and identify a range of seismic signals from historical and continuous data. It builds on the work of Harvard and Google, which in August created an AI model capable of predicting the location of aftershocks up to one year after a major earthquake. The researchers' system consists of neural network layers -- interconnected processing nodes that loosely mimic the function of neurons in the brain -- of two types: convolutional neural networks and recurrent neural networks.

deep learning, earthquake, upstream oil & gas, (12 more...)

Country: North America > United States (0.17)

Industry: Energy > Oil & Gas > Upstream (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.57)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.54)

#artificialintelligenceOct-5-2018, 05:16:36 GMT

Fighting breast cancer with AI early detection Hack and Craft

Breast cancer awareness month is here and, with it, the latest statistics send a stark reminder of just how important early detection is in combating this brutal disease. With revolutionary strides forward in Artificial Intelligence (AI) all that looks set to change for the better. One of the leading causes of death for cancer patients is a late diagnosis, too often brought about by inferior testing facilities, human factors, such as fatigue and loss of concentration, or by the patients themselves, who put off seeing a specialist due to the fear of what they might discover. But now, thanks to nothing short of revolutionary strides forward in Artificial Intelligence (AI) all that looks set to change for the better. AI is capable of advanced learning using large complex datasets and has the potential to perform tasks such as image interpretation.

artificial intelligence, machine learning, neural network, (10 more...)

Country: Europe > Switzerland (0.16)

Industry: Health & Medicine > Therapeutic Area > Oncology > Breast Cancer (0.76)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.50)