Unsupervised word embeddings capture latent knowledge from materials science literature

#artificialintelligence 

Over the last 15 years there has been a surge in the use of machine learning to gain materials chemistry insights. These methods use existing data (largely computed with ab-initio methods) to train statistical models that can make useful predictions about whether chemical compounds will be stable, and the properties they are likely to exhibit. However, a large majority of the knowledge the scientific community has generated to date is recorded as "unstructured" text, and has therefore been largely inaccessible to machine-learning and statistical analysis. In recent years however, the Natural Language Processing (NLP) research community has made great progress on methods to computationally parse and learn from unstructured text. In our paper, we show how the application of an unsupervised NLP model can capture information from the materials chemistry literature in a way that also uncovers latent knowledge previously unknown to the research community.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found