Results


Optimization tips and tricks on Azure SQL Server for Machine Learning Services

#artificialintelligence

By using memory-optimized tables, resume features are stored in main memory and disk IO could be significantly reduced. If the database engine server detects more than 8 physical cores per NUMA node or socket, it will automatically create soft-NUMA nodes that ideally contain 8 cores. We then further created 4 SQL resource pools and 4 external resource pools [7] to specify the CPU affinity of using the same set of CPUs in each node. We can create resource governance for R services on SQL Server [8] by routing those scoring batches into different workload groups (Figure.


How AI will lead to self-healing mobile networks

#artificialintelligence

Today we are routinely awed by the promise of machine learning (ML) and artificial intelligence (AI). Our phones speak to us and our favorite apps can ID our friends and family in our photographs. We didn't get here overnight, of course. Enhancements to the network itself – deep, convolutional neural networks executing advanced computer science techniques – brought us to this point. Now one of the primary beneficiaries of our super-connected world will be the very networks we have come to rely on for information, communication, commerce, and entertainment.


Using Artificial Neural Networks to Predict the Quality and Performance of Oilfield Cements

AITopics Original Links

Inherent batch to batch variability, ageing and contamination are major factors contributing to variability in oilfield cement slurry performance. Of particular concern are problems encountered when a slurry is formulated with one cement sample and used with a batch having different properties. Such variability imposes a heavy burden on performance testing and is often a major factor in operational failure. We describe methods which allow the identification, characterisation and prediction of the variability of oilfield cements. Our approach involves predicting cement compositions, particle size distributions and thickening time curves from the diffuse reflectance infrared Fourier transform spectrum of neat cement powders.


Deep Learning How I Did It: Merck 1st place interview

AITopics Original Links

What was your background prior to entering this challenge? We are a team of computer science and statistics academics. Ruslan Salakhutdinov and Geoff Hinton are professors at the University of Toronto. George Dahl and Navdeep Jaitly are Ph.D. students working with Professor Hinton. Christopher "Gomez" Jordan-Squire is in the mathematics Ph.D. program at the University of Washington, studying (constrained) optimization applied to statistics and machine learning.


Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features

#artificialintelligence

Objective Social media is becoming increasingly popular as a platform for sharing personal health-related information. This information can be utilized for public health monitoring tasks, particularly for pharmacovigilance, via the use of natural language processing (NLP) techniques. However, the language in social media is highly informal, and user-expressed medical concepts are often nontechnical, descriptive, and challenging to extract. There has been limited progress in addressing these challenges, and thus far, advanced machine learning-based NLP techniques have been underutilized. Our objective is to design a machine learning-based approach to extract mentions of adverse drug reactions (ADRs) from highly informal text in social media.


Study identifies 10 Key Trends in AI Market Development - Which-50

#artificialintelligence

The report's authors say, the majority of use cases they studied take existing processes like predictive maintenance, anomaly detection, algorithmic trading, customer service, search engine queries, or cybersecurity threat detection and apply machine learning techniques or other AI techniques that can adapt rules and provide better results than previous static rule techniques. "This will be a fundamental shift, with AI moving from being an automation engine that improves business processes to becoming a decision engine, providing real-time predictions and decision scenarios for companies that can choose the right strategy and outperform competition," say the authors. Successful AI leaders will be those who are good at technology implementation as a starting point, but more importantly can build trust and an effective working relationship between AI systems and the human decision makers." In its report on the whitepaper, business analytics web site Statista notes; "With expected cumulative revenue of just over 8 billion U.S. dollars, 'static image recognition, classification and tagging' is forecast to lead the way, ahead of'algorithmic trading strategy performance improvement' ($7.5 billion) and'efficient, scalable processing of patient data' ($7.4 billion).


Why Machines Still Can't Learn So Good

#artificialintelligence

"Most of the talk about machine learning is a bunch of hokum," said Douglas Greenig, a Ph.D. in mathematics who started quantitative hedge fund Florin Court Capital. Man AHL's scientists and engineers developed and tested the technology on historical data, often expecting it to flop, and only overcame doubts as the strategy produced solid returns in trials. It has returned an annualized 7.1 percent in three years through September, compared with a 3.2 percent gain for the average hedge fund, and an 11 percent rise for the S&P 500 Index. "It's not about applying machine learning techniques to historical data and being satisfied if we rediscover what we know already," Ledford said.


Laplace noising versus simulated out of sample methods (cross frames)

#artificialintelligence

Please read on for my discussion of some of the limitations of the technique, and how we solve the problem for impact coding (also called "effects codes"), and a worked example in R.We define a nested model as any model where the results of a sub-model are used as inputs for a later model. And I now think such a theorem would actually have fairly unsatisfying statement as a one possible "bad real world data" situation violates the usual "no re-use" requirements of differential privacy; duplicated or related columns or variables break the Laplace noising technique. But library code needs to work in the limit (as you don't know ahead of time what users will throw at it) and there are a lot of mechanisms that do produce duplicate, near-duplicate, and related columns in data sources used for data science (one of the difference between data science and classical statistics is data science tends to apply machine learning techniques on very under-curated data sets). The results on our artificial "each column five times" data set are below: Notice that the Laplace noising technique test performances are significantly degraded (performance on held-out test usually being a better simulation of future model performance than performance on the training set).


Machine Learning Already Changing the Entertainment Industry - Futurum

#artificialintelligence

The revenue forecast for enterprise AI changes depending on the source, but it's estimated to be at about $300-$350 million in 2016, and predicted to reach upwards of $30 billion by 2025 The technologies included in this focus are cognitive computing, natural language processing, image recognition, speech recognition, predictive APIs, deep learning, and machine learning. The process of creating a trailer for new horror movie "Morgan" involved using machine learning techniques and experimental APIs through IBM's Watson platform. The software uses the machine learning process of Natural Language Processing (NLP) to analyze thousands of movie plot summaries correlated to box office performance. Machine Learning is also helping entertainment providers recommend personalized content, based on the user's previous viewing activity and behavior.


Personalization advancement through machine learning

#artificialintelligence

Domains such as education, publishing, entertainment, and advertisement mostly deal with granular digital assets (text, images, audio, video, multi-media, and so on), and are better prepared to enhance personalization even without creating new content from scratch. Computer vision: Computer vision techniques that leverage neural networks-based deep learning can detect the type and intensity of consumer emotion from captured images or video frames. Text analytics: Technologies that successfully synthesize text into raw data, allow the information to be personalized when presented in different contexts. Apart from using it in computer vision, scientists are exploring deep learning to address large scale text mining and speech recognition problems.