The relentless increase in computing power and the accumulation of big data over the years has sparked intense interest in machine learning and its associated techniques. The new SAS Visual Data Mining and Machine Learning software will feed this need for smarter analytics. Advanced analytics offer insight to businesses, but machine learning and deep learning algorithms take it deeper, revealing insights that were previously out of reach. For example, machine learning use can include facial recognition in security systems, speech recognition in customer service applications, accurate product recommendations in e-commerce, self-driving cars and medical diagnostics. "SAS Visual Data Mining and Machine Learning shatters barriers related to data volume and variety, limited analytical depth and computational bottlenecks.
Berkeley Lab researchers Vahe Tshitoyan, Anubhav Jain, Leigh Weston, and John Dagdelen used machine learning to analyze 3.3 million abstracts from materials science papers. Sure, computers can be used to play grandmaster-level chess, but can they make scientific discoveries? Researchers at the U.S. Department of Energy's Lawrence Berkeley National Laboratory have shown that an algorithm with no training in materials science can scan the text of millions of papers and uncover new scientific knowledge. A team led by Anubhav Jain, a scientist in Berkeley Lab's Energy Storage & Distributed Resources Division, collected 3.3 million abstracts of published materials science papers and fed them into an algorithm called Word2vec. By analyzing relationships between words the algorithm was able to predict discoveries of new thermoelectric materials years in advance and suggest as-yet unknown materials as candidates for thermoelectric materials.
The successful candidates will join the Data Mining & Machine Learning Group and contribute to a new research project, ROCSAFE (see below) funded by the European Union's Horizon 2020 Programme. The research is likely to involve one of: (1) advances in temporal Bayesian reasoning for decision support; (2) routing of autonomous vehicles for optimal collection of multi-resolution image and sensor data; (3) context-aware decision support driven by sensor data analytics.