AITopics | Ray, Soumya

Collaborating Authors

Ray, Soumya

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Optimizing Drug Design by Merging Generative AI With Active Learning Frameworks

Filella-Merce, Isaac, Molina, Alexis, Orzechowski, Marek, Díaz, Lucía, Zhu, Yang Ming, Mor, Julia Vilalta, Malo, Laura, Yekkirala, Ajay S, Ray, Soumya, Guallar, Victor

arXiv.org Artificial IntelligenceMay-4-2023

Traditional drug discovery programs are being transformed by the advent of machine learning methods. Among these, Generative AI methods (GM) have gained attention due to their ability to design new molecules and enhance specific properties of existing ones. However, current GM methods have limitations, such as low affinity towards the target, unknown ADME/PK properties, or the lack of synthetic tractability. To improve the applicability domain of GM methods, we have developed a workflow based on a variational autoencoder coupled with active learning steps. The designed GM workflow iteratively learns from molecular metrics, including drug likeliness, synthesizability, similarity, and docking scores. In addition, we also included a hierarchical set of criteria based on advanced molecular modeling simulations during a final selection step. We tested our GM workflow on two model systems, CDK2 and KRAS. In both cases, our model generated chemically viable molecules with a high predicted affinity toward the targets. Particularly, the proportion of high-affinity molecules inferred by our GM workflow was significantly greater than that in the training data. Notably, we also uncovered novel scaffolds significantly dissimilar to those known for each target. These results highlight the potential of our GM workflow to explore novel chemical space for specific targets, thereby opening up new possibilities for drug discovery endeavors.

artificial intelligence, machine learning, molecule, (18 more...)

arXiv.org Artificial Intelligence

2305.06334

Country: North America > United States > Massachusetts (0.28)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Government > Regional Government > North America Government > United States Government > FDA (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.60)

Add feedback

Beyond Fair Pay: Ethical Implications of NLP Crowdsourcing

Shmueli, Boaz, Fell, Jan, Ray, Soumya, Ku, Lun-Wei

arXiv.org Artificial IntelligenceApr-20-2021

The use of crowdworkers in NLP research is growing rapidly, in tandem with the exponential increase in research production in machine learning and AI. Ethical discussion regarding the use of crowdworkers within the NLP research community is typically confined in scope to issues related to labor conditions such as fair pay. We draw attention to the lack of ethical considerations related to the various tasks performed by workers, including labeling, evaluation, and production. We find that the Final Rule, the common ethical framework used by researchers, did not anticipate the use of online crowdsourcing platforms for data collection, resulting in gaps between the spirit and practice of human-subjects ethics in NLP research. We enumerate common scenarios where crowdworkers performing NLP tasks are at risk of harm. We thus recommend that researchers evaluate these risks by considering the three ethical principles set up by the Belmont Report. We also clarify some common misconceptions regarding the Institutional Review Board (IRB) application. We hope this paper will serve to reopen the discussion within our community regarding the ethical use of crowdworkers.

artificial intelligence, ethical implication, social media, (2 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2021.naacl-main.295

2104.10097

Genre: Research Report (0.69)

Technology:

Information Technology > Communications > Social Media > Crowdsourcing (0.60)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.53)

Add feedback

Reactive Supervision: A New Method for Collecting Sarcasm Data

Shmueli, Boaz, Ku, Lun-Wei, Ray, Soumya

arXiv.org Artificial IntelligenceSep-28-2020

Sarcasm detection is an important task in affective computing, requiring large amounts of labeled data. We introduce reactive supervision, a novel data collection method that utilizes the dynamics of online conversations to overcome the limitations of existing data collection techniques. We use the new method to create and release a first-of-its-kind large dataset of tweets with sarcasm perspective labels and new contextual features. The dataset is expected to advance sarcasm detection research. Our method can be adapted to other affective computing domains, thus opening up new research opportunities.

collecting sarcasm data, new method, reactive supervision

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2020.emnlp-main.201

2009.1308

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence (0.73)

Add feedback

Gesture Annotation With a Visual Search Engine for Multimodal Communication Research

Turchyn, Sergiy (Case Western Reserve University) | Moreno, Inés Olza (Institute for Culture and Society, University of Navarra) | Cánovas, Cristóbal Pagán (Institute for Culture and Society, University of Navarra) | Steen, Francis F. (University of California-Los Angeles) | Turner, Mark (Case Western Reserve University) | Valenzuela, Javier (University of Murcia) | Ray, Soumya (Case Western Reserve University)

AAAI ConferencesFeb-8-2018

Human communication is multimodal and includes elements such as gesture and facial expression along with spoken language. Modern technology makes it feasible to capture all such aspects of communication in natural settings. As a result, similar to fields such as genetics, astronomy and neuroscience, scholars in areas such as linguistics and communication studies are on the verge of a data-driven revolution in their fields. These new approaches require analytical support from machine learning and artificial intelligence to develop tools to help process the vast data repositories. The Distributed Little Red Hen Lab project is an international team of interdisciplinary researchers building a large-scale infrastructure for data-driven multimodal communications research. In this paper, we describe a machine learning system developed to automatically annotate a large database of television program videos as part of this project. The annotations mark regions where people or speakers are on screen along with body part motions including head, hand and shoulder motion. We also annotate a specific class of gestures known as timeline gestures. An existing gesture annotation tool, ELAN, can be used with these annotations to quickly locate gestures of interest. Finally, we provide an update mechanism for the system based on human feedback. We empirically evaluate the accuracy of the system as well as present data from pilot human studies to show its effectiveness at aiding gesture scholars in their work.

annotation, artificial intelligence, natural language, (19 more...)

AAAI Conferences

Thirty-Second AAAI Conference on Artificial Intelligence

Country: North America > United States > California > Los Angeles County > Los Angeles (0.14)

Industry: Media > Film (0.51)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Automated Volumetric Intravascular Plaque Classification Using Optical Coherence Tomography

Shalev, Ronny (Case Western Reserve University) | Nakamura, Daisuke (University Hospitals Case Medical Center, Cleveland) | Nishino, Setsu (University Hospitals Case Medical Center, Cleveland) | Rollins, Andrew (Case Western Reserve University) | Bezerra, Hiram (University Hospitals Case Medical Center, Cleveland) | Wilson, David (Case Western Reserve University) | Ray, Soumya (Case Western Reserve University)

AI MagazineMar-31-2017

An estimated 17.5 million people died from a cardiovascular disease in 2012, representing 31 percent of all global deaths. Most acute coronary events result from rupture of the protective fibrous cap overlying an atherosclerotic plaque. The task of early identification of plaque types that can potentially rupture is, therefore, of great importance. The state-of-the-art approach to imaging blood vessels is intravascular optical coherence tomography (IVOCT). However, currently, this is an offline approach where the images are first collected and then manually analyzed an image at a time to identify regions at risk of thrombosis. This process is extremely laborious, time consuming and prone to human error. We are building a system that, when complete, will provide interactive 3D visualization of a blood vessel as an IVOCT is in progress. The visualization will highlight different plaque types and enable quick identification of regions at risk for thrombosis. In this paper, we describe our approach, focusing on machine learning methods that are a key enabling technology. Our empirical results using real OCT data show that our approach can identify different plaque types efficiently with high accuracy across multiple patients.

cardiology, plaque, vascular disease, (20 more...)

AI Magazine

Country: North America > United States > Ohio (0.14)

Genre: Research Report (0.88)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)

Technology:

Information Technology > Sensing and Signal Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

Automated Volumetric Intravascular Plaque Classification Using Optical Coherence Tomography (OCT)

AAAI ConferencesFeb-10-2016

An estimated 17.5 million people died from a cardiovascular disease in 2012, representing 31% of all global deaths. Most acute coronary events result from rupture of the protective fibrous cap overlying an atherosclerotic plaque. The task of early identification of plaque types that can potentially rupture is, therefore, of great importance. The state-of-the-art approach to imaging blood vessels is intravascular optical coherence tomography (IVOCT). However, currently, this is an offline approach where the images are first collected and then manually analyzed a frame at a time to identify regions at risk of thrombosis. This process is extremely laborious, time consuming and prone to human error. We are building a system that, when complete, will provide interactive 3D visualization of a blood vessel as an IVOCT is in progress. The visualization will highlight different plaque types and enable quick identification of regions at risk for thrombosis. In this paper, we describe our approach, focusing on machine learning methods that are a key enabling technology. Our empirical results using real OCT data show that our approach can identify different plaque types efficiently with high accuracy across multiple patients.

cardiology, plaque type, vascular disease, (18 more...)

AAAI Conferences

Twenty-Eighth IAAI Conference

Country: North America > United States > Ohio (0.15)

Genre:

Research Report (0.66)
Overview > Innovation (0.34)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)

Technology:

Information Technology > Sensing and Signal Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.47)

Add feedback

Learning Instance Concepts from Multiple-Instance Data with Bags as Distributions

Doran, Gary (Case Western Reserve University) | Ray, Soumya (Case Western Reserve University)

AAAI ConferencesJul-14-2014

We analyze and evaluate a generative process for multiple-instance learning (MIL) in which bags are distributions over instances. We show that our generative process contains as special cases generative models explored in prior work, while excluding scenarios known to be hard for MIL. Further, under the mild assumption that every negative instance is observed with nonzero probability in some negative bag, we show that it is possible to learn concepts that accurately label instances from MI data in this setting. Finally, we show that standard supervised approaches can learn concepts with low area-under-ROC error from MI data in this setting. We validate this surprising result with experiments using several synthetic and real-world MI datasets that have been annotated with instance labels.

Add feedback

Detection and Prediction of Adverse and Anomalous Events in Medical Robots

Liang, Kai (Case Western Reserve University) | Cao, Feng (Case Western Reserve University) | Bai, Zhuofu (Case Western Reserve University) | Renfrew, Mark (Case Western Reserve University) | Cavusoglu, Murat Cenk (Case Western Reserve University) | Podgurski, Andy (Case Western Reserve University) | Ray, Soumya (Case Western Reserve University)

AAAI ConferencesJul-9-2013

Adverse and anomalous (A&A) events are a serious concern in medical robots. We describe a system that can rapidly detect such events and predict their occurrence. As part of this system, we describe simulation, data collection and user interface tools we build for a robot for small animal biopsies. The data we collect consists of both the hardware state of the robot and variables in the software controller. We use this data to train dynamic Bayesian network models of the joint hardware-software state-space dynamics of the robot. Our empirical evaluation shows that (i) our models can accurately model normal behavior of the robot, (ii) they can rapidly detect anomalous behavior once it starts, (iii) they can accurately predict a future A&A event within a time window of it starting and (iv) the use of additional software variables beyond the hardware state of the robot is important in being able to detect and predict certain kinds of events.

adverse and anomalous event, detection and prediction, medical robot

AAAI Conferences

Twenty-Fifth IAAI Conference

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

SMILe: Shuffled Multiple-Instance Learning

Doran, Gary (Case Western Reserve University) | Ray, Soumya (Case Western Reserve University)

AAAI ConferencesJul-9-2013

Resampling techniques such as bagging are often used in supervised learning to produce more accurate classifiers. In this work, we show that multiple-instance learning admits a different form of resampling, which we call "shuffling." In shuffling, we resample instances in such a way that the resulting bags are likely to be correctly labeled. We show that resampling results in both a reduction of bag label noise and a propagation of additional informative constraints to a multiple-instance classifier. We empirically evaluate shuffling in the context of multiple-instance classification and multiple-instance active learning and show that the approach leads to significant improvements in accuracy.

artificial intelligence, classifier, machine learning, (17 more...)

AAAI Conferences

Twenty-Seventh AAAI Conference on Artificial Intelligence

Country: North America > United States (0.68)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

SEPIA: A Scalable Game Environment for Artificial Intelligence Teaching and Research

Sosnowski, Scott (Case Western Reserve University) | Ernsberger, Tim (Case Western Reserve University) | Cao, Feng (Case Western Reserve University) | Ray, Soumya (Case Western Reserve University)

AAAI ConferencesJul-9-2013

We describe a game environment we have developed that we call the Strategy Engine for Programming Intelligent Agents (SEPIA). SEPIA is based on real-time strategy games, but modified extensively to preferentially support the development of artificial agents rather than human play. Through flexible configuration options, SEPIA is designed to be pedagogically scalable: suitable for use at the undergraduate and graduate levels, and also as a research testbed. We also describe assignments and our experiences with this environment in undergraduate and graduate classes.

artificial intelligence teaching and research, scalable game environment, sépia

AAAI Conferences

Fourth AAAI Symposium on Educational Advances in Artificial Intelligence

Industry:

Leisure & Entertainment > Games > Computer Games (0.89)
Information Technology > Software (0.89)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback