AITopics | Sriram, Ram D.

Collaborating Authors

Sriram, Ram D.

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

LineConGraphs: Line Conversation Graphs for Effective Emotion Recognition using Graph Neural Networks

Krishnan, Gokul S, Padi, Sarala, Greenberg, Craig S., Ravindran, Balaraman, Manoch, Dinesh, Sriram, Ram D.

arXiv.org Artificial IntelligenceDec-4-2023

Emotion Recognition in Conversations (ERC) is a critical aspect of affective computing, and it has many practical applications in healthcare, education, chatbots, and social media platforms. Earlier approaches for ERC analysis involved modeling both speaker and long-term contextual information using graph neural network architectures. However, it is ideal to deploy speaker-independent models for real-world applications. Additionally, long context windows can potentially create confusion in recognizing the emotion of an utterance in a conversation. To overcome these limitations, we propose novel line conversation graph convolutional network (LineConGCN) and graph attention (LineConGAT) models for ERC analysis. These models are speaker-independent and built using a graph construction strategy for conversations -- line conversation graphs (LineConGraphs). The conversational context in LineConGraphs is short-term -- limited to one previous and future utterance, and speaker information is not part of the graph. We evaluate the performance of our proposed models on two benchmark datasets, IEMOCAP and MELD, and show that our LineConGAT model outperforms the state-of-the-art methods with an F1-score of 64.58% and 76.50%. Moreover, we demonstrate that embedding sentiment shift information into line conversation graphs further enhances the ERC performance in the case of GCN models.

machine learning, natural language, utterance, (21 more...)

arXiv.org Artificial Intelligence

2312.03756

Country: North America > United States > Maryland > Prince George's County > College Park (0.14)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.48)
Research Report > New Finding (0.46)

Industry: Health & Medicine (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation

Padi, Sarala, Sadjadi, Seyed Omid, Manocha, Dinesh, Sriram, Ram D.

arXiv.org Artificial IntelligenceAug-16-2021

Automatic speech emotion recognition (SER) is a challenging task that plays a crucial role in natural human-computer interaction. One of the main challenges in SER is data scarcity, i.e., insufficient amounts of carefully labeled data to build and fully explore complex deep learning models for emotion classification. This paper aims to address this challenge using a transfer learning strategy combined with spectrogram augmentation. Specifically, we propose a transfer learning approach that leverages a pre-trained residual network (ResNet) model including a statistics pooling layer from speaker recognition trained using large amounts of speaker-labeled data. The statistics pooling layer enables the model to efficiently process variable-length input, thereby eliminating the need for sequence truncation which is commonly used in SER systems. In addition, we adopt a spectrogram augmentation technique to generate additional training data samples by applying random time-frequency masks to log-mel spectrograms to mitigate overfitting and improve the generalization of emotion recognition models. We evaluate the effectiveness of our proposed approach on the interactive emotional dyadic motion capture (IEMOCAP) dataset. Experimental results indicate that the transfer learning and spectrogram augmentation approaches improve the SER performance, and when combined achieve state-of-the-art results.

deep learning, emotion recognition, neural network, (16 more...)

arXiv.org Artificial Intelligence

2108.0251

Country: North America > United States > Maryland (0.28)

Genre: Research Report > New Finding (0.47)

Industry: Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Emotion (1.00)

Add feedback

Multi-Window Data Augmentation Approach for Speech Emotion Recognition

Padi, Sarala, Manocha, Dinesh, Sriram, Ram D.

arXiv.org Artificial IntelligenceOct-27-2020

We present a novel, Multi-Window Data Augmentation (MWA-SER) approach for speech emotion recognition. MWA-SER is a unimodal approach that focuses on two key concepts; designing the speech augmentation method to generate additional data samples and building the deep learning models to recognize the underlying emotion of an audio signal. The multi-window augmentation method extracts more audio features from the speech signal by employing multiple window sizes in the audio feature extraction process. We show that our proposed augmentation method, combined with a deep learning model, improves the speech emotion recognition performance. We evaluate the performance of our MWA-SER approach on the IEMOCAP corpus and show that our proposed method achieves state-of-the-art results. Furthermore, the proposed system demonstrated 70% and 88% accuracy while recognizing the emotions for the SAVEE and RAVDESS datasets, respectively.

deep learning, emotion, neural network, (18 more...)

arXiv.org Artificial Intelligence

2010.09895

Country: North America > United States > Maryland (0.28)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

Compositional Models for the Internet of Everything

Breiner, Spencer (NIST) | Sriram, Ram D. (NIST) | Subrahmanian, Eswaran (Carnegie Mellon University)

AAAI ConferencesMar-21-2018

In this note we identify four fundamental characteristics of the IoE which are vexing to handle in practice: heterogeneity, composition, perspective and joint cognition. We discuss the way that each of these introduces a new dimension of complexity for the application of machine learning and artificial intelligence in the IoE. Finally, we introduce some mathematical methods from category theory which we believe can help to address these obstacles.

compositional model, internet

AAAI Conferences

2018 AAAI Spring Symposium Series

Technology:

Information Technology > Artificial Intelligence (0.53)
Information Technology > Communications > Networks (0.40)

Add feedback