AITopics | Szolovits, Peter

Collaborating Authors

Szolovits, Peter

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Transfer Learning for Named-Entity Recognition with Neural Networks

Lee, Ji Young, Dernoncourt, Franck, Szolovits, Peter

arXiv.org Machine LearningMay-17-2017

Recent approaches based on artificial neural networks (ANNs) have shown promising results for named-entity recognition (NER). In order to achieve high performances, ANNs need to be trained on a large labeled dataset. However, labels might be difficult to obtain for the dataset on which the user wants to perform NER: label scarcity is particularly pronounced for patient note de-identification, which is an instance of NER. In this work, we analyze to what extent transfer learning may address this issue. In particular, we demonstrate that transferring an ANN model trained on a large labeled dataset to another dataset with a limited number of labels improves upon the state-of-the-art results on two different datasets for patient note de-identification.

dataset, deep learning, neural network, (20 more...)

arXiv.org Machine Learning

1705.06273

Country: North America > United States (0.70)

Genre: Research Report (0.50)

Industry:

Health & Medicine > Health Care Technology > Medical Record (0.49)
Health & Medicine > Therapeutic Area (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.79)

Add feedback

NeuroNER: an easy-to-use program for named-entity recognition based on neural networks

Dernoncourt, Franck, Lee, Ji Young, Szolovits, Peter

arXiv.org Machine LearningMay-15-2017

Named-entity recognition (NER) aims at identifying entities of interest in a text. Artificial neural networks (ANNs) have recently been shown to outperform existing NER systems. However, ANNs remain challenging to use for non-expert users. In this paper, we present NeuroNER, an easy-to-use named-entity recognition tool based on ANNs. Users can annotate entities using a graphical web-based user interface (BRAT): the annotations are then used to train an ANN, which in turn predict entities' locations and categories in new texts. NeuroNER makes this annotation-training-prediction flow smooth and accessible to anyone.

deep learning, neural network, neuroner, (21 more...)

arXiv.org Machine Learning

1705.05487

Country:

Asia (0.29)
North America > United States > Michigan (0.14)

Genre: Research Report (0.50)

Industry: Health & Medicine (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

MIT at SemEval-2017 Task 10: Relation Extraction with Convolutional Neural Networks

Lee, Ji Young, Dernoncourt, Franck, Szolovits, Peter

arXiv.org Machine LearningApr-5-2017

Over 50 million scholarly articles have been published: they constitute a unique repository of knowledge. In particular, one may infer from them relations between scientific concepts, such as synonyms and hyponyms. Artificial neural networks have been recently explored for relation extraction. In this work, we continue this line of work and present a system based on a convolutional neural network to extract relations. Our model ranked first in the SemEval-2017 task 10 (ScienceIE) for relation extraction in scientific articles (subtask C).

deep learning, neural network, relation, (20 more...)

arXiv.org Machine Learning

1704.01523

Country: North America > Canada (0.28)

Genre: Research Report (0.82)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.86)

Add feedback

Neural Networks for Joint Sentence Classification in Medical Paper Abstracts

Dernoncourt, Franck, Lee, Ji Young, Szolovits, Peter

arXiv.org Machine LearningDec-15-2016

Existing models based on artificial neural networks (ANNs) for sentence classification often do not incorporate the context in which sentences appear, and classify sentences individually. However, traditional sentence classification approaches have been shown to greatly benefit from jointly classifying subsequent sentences, such as with conditional random fields. In this work, we present an ANN architecture that combines the effectiveness of typical ANN models to classify sentences in isolation, with the strength of structured prediction. Our model achieves state-of-the-art results on two different datasets for sequential sentence classification in medical abstracts.

classification, deep learning, neural network, (17 more...)

arXiv.org Machine Learning

1612.05251

Genre: Research Report > Experimental Study (0.46)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Add feedback

Feature-Augmented Neural Networks for Patient Note De-identification

Lee, Ji Young, Dernoncourt, Franck, Uzuner, Ozlem, Szolovits, Peter

arXiv.org Machine LearningOct-30-2016

Patient notes contain a wealth of information of potentially great interest to medical investigators. However, to protect patients' privacy, Protected Health Information (PHI) must be removed from the patient notes before they can be legally released, a process known as patient note de-identification. The main objective for a de-identification system is to have the highest possible recall. Recently, the first neural-network-based de-identification system has been proposed, yielding state-of-the-art results. Unlike other systems, it does not rely on human-engineered features, which allows it to be quickly deployed, but does not leverage knowledge from human experts or from electronic health records (EHRs). In this work, we explore a method to incorporate human-engineered features as well as features derived from EHRs to a neural-network-based de-identification system. Our results show that the addition of features, especially the EHR-derived features, further improves the state-of-the-art in patient note de-identification, including for some of the most sensitive PHI types such as patient names. Since in a real-life setting patient notes typically come with EHRs, we recommend developers of de-identification systems to leverage the information EHRs contain.

deep learning, neural network, patient note, (19 more...)

arXiv.org Machine Learning

1610.09704

Country: North America > United States (0.95)

Genre: Research Report > New Finding (0.68)

Industry:

Law (1.00)
Health & Medicine > Health Care Technology > Medical Record (1.00)
Health & Medicine > Health Care Providers & Services (1.00)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.73)

Add feedback

De-identification of Patient Notes with Recurrent Neural Networks

Dernoncourt, Franck, Lee, Ji Young, Uzuner, Ozlem, Szolovits, Peter

arXiv.org Machine LearningJun-10-2016

Objective: Patient notes in electronic health records (EHRs) may contain critical information for medical investigations. However, the vast majority of medical investigators can only access de-identified notes, in order to protect the confidentiality of patients. In the United States, the Health Insurance Portability and Accountability Act (HIPAA) defines 18 types of protected health information (PHI) that needs to be removed to de-identify patient notes. Manual de-identification is impractical given the size of EHR databases, the limited number of researchers with access to the non-de-identified notes, and the frequent mistakes of human annotators. A reliable automated de-identification system would consequently be of high value. Materials and Methods: We introduce the first de-identification system based on artificial neural networks (ANNs), which requires no handcrafted features or rules, unlike existing systems. We compare the performance of the system with state-of-the-art systems on two datasets: the i2b2 2014 de-identification challenge dataset, which is the largest publicly available de-identification dataset, and the MIMIC de-identification dataset, which we assembled and is twice as large as the i2b2 2014 dataset. Results: Our ANN model outperforms the state-of-the-art systems. It yields an F1-score of 97.85 on the i2b2 2014 dataset, with a recall 97.38 and a precision of 97.32, and an F1-score of 99.23 on the MIMIC de-identification dataset, with a recall 99.25 and a precision of 99.06. Conclusion: Our findings support the use of ANNs for de-identification of patient notes, as they show better performance than previously published systems while requiring no feature engineering.

dataset, deep learning, neural network, (24 more...)

arXiv.org Machine Learning

1606.03475

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)

Genre:

Research Report > Experimental Study (0.68)
Research Report > New Finding (0.48)

Industry:

Health & Medicine > Health Care Technology > Medical Record (1.00)
Health & Medicine > Health Care Providers & Services (1.00)
Health & Medicine > Government Relations & Public Policy (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Multivariate Timeseries Modeling Approach to Severity of Illness Assessment and Forecasting in ICU with Sparse, Heterogeneous Clinical Data

Ghassemi, Marzyeh (Massachusetts Institute of Technology) | Pimentel, Marco A.F. (University of Oxford) | Naumann, Tristan (Massachusetts Institute of Technology) | Brennan, Thomas (Massachusetts Institute of Technology) | Clifton, David A. (University of Oxford) | Szolovits, Peter (Massachusetts Institute of Technology) | Feng, Mengling (Massachusetts Institute of Technology)

AAAI ConferencesMar-6-2015

The ability to determine patient acuity (or severity of illness) has immediate practical use for clinicians. We evaluate the use of multivariate timeseries modeling with the multi-task Gaussian process (GP) models using noisy, incomplete, sparse, heterogeneous and unevenly-sampled clinical data, including both physiological signals and clinical notes. The learned multi-task GP (MTGP) hyperparameters are then used to assess and forecast patient acuity. Experiments were conducted with two real clinical data sets acquired from ICU patients: firstly, estimating cerebrovascular pressure reactivity, an important indicator of secondary damage for traumatic brain injury patients, by learning the interactions between intracranial pressure and mean arterial blood pressure signals, and secondly, mortality prediction using clinical progress notes. In both cases, MTGPs provided improved results: an MTGP model provided better results than single-task GP models for signal interpolation and forecasting (0.91 vs 0.69 RMSE), and the use of MTGP hyperparameters obtained improved results when used as additional classification features (0.812 vs 0.788 AUC).

cardiology, timesery, vascular disease, (24 more...)

AAAI Conferences

Twenty-Ninth AAAI Conference on Artificial Intelligence

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.15)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Diagnostic Medicine (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.93)

Technology:

Information Technology > Data Science (0.93)
Information Technology > Biomedical Informatics > Clinical Informatics (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

PI-in-a-Box: A Knowledge-Based System for Space Science Experimentation

Franier, Richard, Groleau, Nicholas, Hazelton, Lyman, Colombano, Silvano, Compton, Michael, Statler, Irving, Szolovits, Peter, Young, Laurence

AI MagazineMar-15-1994

The principal investigator (PI)-IN-A-BOX knowledge based system helps astronauts perform science experiments in space. This environment suggests the use of advanced techniques for data collection, analysis, and decision making to maximize the value of the research performed. PI-IN-A-BOX aids astronauts with quick-look data collection, reduction, and analysis as well as equipment diagnosis and troubleshooting, procedural reminders, and suggestions for high-value departures from the preplanned experiment protocol. The system is in use on the ground for mission training and was used in flight during the October 1993 space life sciences 2 (SLS-2) shuttle mission.

artificial intelligence, knowledge engineering, space science experimentation, (7 more...)

AI Magazine

Technology:

Information Technology > Knowledge Management > Knowledge Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)

Add feedback

PI-in-a-Box: A Knowledge-Based System for Space Science Experimentation

Franier, Richard, Groleau, Nicholas, Hazelton, Lyman, Colombano, Silvano, Compton, Michael, Statler, Irving, Szolovits, Peter, Young, Laurence

AI MagazineMar-15-1994

The principal investigator (PI)-IN-A-BOX knowledge based system helps astronauts perform science experiments in space. These experiments are typically costly to devise and build and often are difficult to perform. Further, the space laboratory environment is unique; ever changing; hectic; and, therefore, stressful. The environment requires quick, correct reactions to events over a wide range of experiments and disciplines, including ones distant from an astronaut's main science specialty. This environment suggests the use of advanced techniques for data collection, analysis, and decision making to maximize the value of the research performed. PI-IN-A-BOX aids astronauts with quick-look data collection, reduction, and analysis as well as equipment diagnosis and troubleshooting, procedural reminders, and suggestions for high-value departures from the preplanned experiment protocol. The astronauts have direct access to the system, which is hosted on a portable computer in the Space Lab module. The system is in use on the ground for mission training and was used in flight during the October 1993 space life sciences 2 (SLS-2) shuttle mission.

air transportation, experiment, us government, (23 more...)

AI Magazine

Country: North America > United States > Texas > Harris County > Houston (0.14)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Government > Space Agency (0.95)
Transportation > Air (0.88)
Health & Medicine > Therapeutic Area (0.68)

Technology:

Information Technology > Knowledge Management > Knowledge Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)

Add feedback

What Is a Knowledge Representation?

Davis, Randall, Shrobe, Howard, Szolovits, Peter

AI MagazineMar-15-1993

Although knowledge representation is one of the central and, in some ways, most familiar concepts in AI, the most fundamental question about it -- What is it? Numerous papers have lobbied for one or another variety of representation, other papers have argued for various properties a representation should have, and still others have focused on properties that are important to the notion of representation in general. In this article, we go back to basics to address the question directly. We believe that the answer can best be understood in terms of five important and distinctly different roles that a representation plays, each of which places different and, at times, conflicting demands on the properties a representation should have.

artificial intelligence, PROBLEM SOLVING, representation, (2 more...)

AI Magazine

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.66)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.66)

Add feedback