AITopics | Dunn, Alexander

Collaborating Authors

Dunn, Alexander

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Extracting Structured Seed-Mediated Gold Nanorod Growth Procedures from Literature with GPT-3

Walker, Nicholas, Dagdelen, John, Cruse, Kevin, Lee, Sanghoon, Gleason, Samuel, Dunn, Alexander, Ceder, Gerbrand, Alivisatos, A. Paul, Persson, Kristin A., Jain, Anubhav

arXiv.org Artificial IntelligenceApr-26-2023

Abstract--Although gold nanorods have been the subject of much research, the pathways for controlling their shape and thereby their optical properties remain largely heuristically understood. Although it is apparent that the simultaneous presence of and interaction between various reagents during synthesis control these properties, computational and experimental approaches for exploring the synthesis space can be either intractable or too time-consuming in practice. This motivates an alternative approach leveraging the wealth of synthesis information already embedded in the body of scientific literature by developing tools to extract relevant structured data in an automated, high-throughput manner. To that end, we present an approach using the powerful GPT-3 language model to extract structured multi-step seed-mediated growth procedures and outcomes for gold nanorods from unstructured scientific text. GPT-3 prompt completions are finetuned to predict synthesis templates in the form of JSON documents from unstructured text input with an overall accuracy of 86%. The performance is notable, considering the model is performing simultaneous entity recognition and relation extraction. We present a dataset of 11,644 entities extracted from 1,137 papers, resulting in 268 papers with at least one complete seed-mediated gold nanorod growth procedure and outcome for a total of 332 complete procedures. In the last three semiconductor technology,[11, 12] biomedicine,[13, 14] and decades, chemists have developed the ability to synthesize cosmetics.[15] The suitability of a nanoparticle for a particular anisotropic metal nanoparticles in a controllable and re-application depends on its morphology and size, which correspond to different plasmonic properties.[16,

information, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2304.13846

Country: North America > United States > California (0.14)

Genre: Research Report (0.82)

Industry:

Health & Medicine (1.00)
Energy (1.00)
Materials > Metals & Mining (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Structured information extraction from complex scientific text with fine-tuned large language models

Dunn, Alexander, Dagdelen, John, Walker, Nicholas, Lee, Sanghoon, Rosen, Andrew S., Ceder, Gerbrand, Persson, Kristin, Jain, Anubhav

arXiv.org Artificial IntelligenceDec-10-2022

This completion can be formatted as either English sentences or a more structured schema such as a list of JSON documents. Large language models (LLMs) such as GPT-3 [12], PaLM To use this method, one only has to define the desired [25], Megatron [26], OPT [27], Gopher [28], and FLAN [29] output structure--for example, a list of JSON objects with a have been shown to have remarkable ability to leverage semantic predefined set of keys--and annotate 100 500 text passages information between tokens in natural language sequences using this format. GPT-3 is then fine-tuned on these of varying length. They are particularly adept at examples, and the resulting model is able to accurately extract sequence-to-sequence (seq2seq) tasks, where a text input is desired information from text and output information in used to seed a text response from the model. In this paper the same structured representation as shown in Figure 1.

artificial intelligence, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2212.05238

Country: North America > United States > California > Alameda County > Berkeley (0.14)

Genre:

Research Report (0.50)
Workflow (0.46)

Industry:

Materials > Chemicals (0.46)
Energy > Energy Storage (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback