AITopics | Manuvinakurike, Ramesh

Collaborating Authors

Manuvinakurike, Ramesh

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ACE, Action and Control via Explanations: A Proposal for LLMs to Provide Human-Centered Explainability for Multimodal AI Assistants

Watkins, Elizabeth Anne, Moss, Emanuel, Manuvinakurike, Ramesh, Shi, Meng, Beckwith, Richard, Raffa, Giuseppe

arXiv.org Artificial IntelligenceFeb-27-2025

In this short paper we address issues related to building multimodal AI systems for human performance support in manufacturing domains. We make two contributions: we first identify challenges of participatory design and training of such systems, and secondly, to address such challenges, we propose the ACE paradigm: "Action and Control via Explanations". Specifically, we suggest that LLMs can be used to produce explanations in the form of human interpretable "semantic frames", which in turn enable end users to provide data the AI system needs to align its multimodal models and representations, including computer vision, automatic speech recognition, and document inputs. ACE, by using LLMs to "explain" using semantic frames, will help the human and the AI system to collaborate, together building a more accurate model of humans activities and behaviors, and ultimately more accurate predictive outputs for better task support, and better outcomes for human users performing manual tasks.

artificial intelligence, large language model, natural language, (16 more...)

arXiv.org Artificial Intelligence

2503.16466

Country: North America > United States > California (0.15)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.84)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.75)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.55)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.51)

Add feedback

QPM: Discrete Optimization for Globally Interpretable Image Classification

Norrenbrock, Thomas, Kaiser, Timo, Biswas, Sovan, Manuvinakurike, Ramesh, Rosenhahn, Bodo

arXiv.org Artificial IntelligenceFeb-27-2025

Understanding the classifications of deep neural networks, e.g. used in safety-critical situations, is becoming increasingly important. While recent models can locally explain a single decision, to provide a faithful global explanation about an accurate model's general behavior is a more challenging open task. Towards that goal, we introduce the Quadratic Programming Enhanced Model (QPM), which learns globally interpretable class representations. QPM represents every class with a binary assignment of very few, typically 5, features, that are also assigned to other classes, ensuring easily comparable contrastive class representations. This compact binary assignment is found using discrete optimization based on predefined similarity measures and interpretability constraints. The resulting optimal assignment is used to fine-tune the diverse features, so that each of them becomes the shared general concept between the assigned classes. Extensive evaluations show that QPM delivers unprecedented global interpretability across small and large-scale datasets while setting the state of the art for the accuracy of interpretable models.

artificial intelligence, data mining, machine learning, (21 more...)

arXiv.org Artificial Intelligence

2502.2013

Country: Europe > Germany (0.28)

Genre: Research Report (0.50)

Industry: Government (0.67)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
(2 more...)

Add feedback

QA-TOOLBOX: Conversational Question-Answering for process task guidance in manufacturing

Manuvinakurike, Ramesh, Watkins, Elizabeth, Savur, Celal, Rhodes, Anthony, Biswas, Sovan, Mejia, Gesem Gudino, Beckwith, Richard, Sahay, Saurav, Raffa, Giuseppe, Nachman, Lama

arXiv.org Artificial IntelligenceDec-3-2024

In this work we explore utilizing LLMs for data augmentation for manufacturing task guidance system. The dataset consists of representative samples of interactions with technicians working in an advanced manufacturing setting. The purpose of this work to explore the task, data augmentation for the supported tasks and evaluating the performance of the existing LLMs. We observe that that task is complex requiring understanding from procedure specification documents, actions and objects sequenced temporally. The dataset consists of 200,000+ question/answer pairs that refer to the spec document and are grounded in narrations and/or video demonstrations. We compared the performance of several popular open-sourced LLMs by developing a "baseline" using each LLM and then compared the responses in a reference-free setting using LLM-as-a-judge and compared the ratings with crowd-workers whilst validating the ratings with experts.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2412.02638

Country: North America > United States (0.14)

Genre: Research Report (0.82)

Industry:

Education (0.68)
Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Zero-shot Conversational Summarization Evaluations with small Large Language Models

Manuvinakurike, Ramesh, Sahay, Saurav, Manepalli, Sangeeta, Nachman, Lama

arXiv.org Artificial IntelligenceNov-29-2023

However, their capabilities on conversational summarization remains under explored. In this work we evaluate LLMs ( 10 billion parameters) on conversational summarization and showcase their performance on various prompts. We show that the summaries generated by models depend on the instructions and the performance of LLMs vary with different instructions sometimes resulting steep drop in ROUGE scores if prompts are not selected carefully. We also evaluate the models with human evaluations and discuss the limitations of the models on conversational summarization.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2311.18041

Country: North America > United States (0.67)

Genre: Research Report (1.00)

Industry:

Health & Medicine (1.00)
Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Sample Efficient Multimodal Semantic Augmentation for Incremental Summarization

Bhattacharyya, Sumanta, Manuvinakurike, Ramesh, Mazumder, Sahisnu, Sahay, Saurav

arXiv.org Artificial IntelligenceMar-7-2023

Our work utilizes Summarization is the consolidated format for a learning of these semantic concepts as an intermediate large document and has been widely used for step from the videos. These semantic concepts many applications i.e., understanding a long meeting/event, along with the transcriptions (semantic augmentation) story summarization etc. Abstractive as input to a pre-trained summarizer model summarization is challenging in the Natural Language enrich the performance. In this work, we address Generation(NLG) domain as it requires an the problem of (i) generating semantically relevant understanding of all the salient information in annotations of a video (semantic concepts) using a the input document and rewriting logically in a fixed number of sampled frames from each video condensed manner rather than selection (extractive).

artificial intelligence, natural language, text processing, (15 more...)

arXiv.org Artificial Intelligence

2303.04361

Country: Europe > Netherlands (0.14)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Add feedback

Position Matters! Empirical Study of Order Effect in Knowledge-grounded Dialogue

Su, Hsuan, Kumar, Shachi H, Mazumder, Sahisnu, Chen, Wenda, Manuvinakurike, Ramesh, Okur, Eda, Sahay, Saurav, Nachman, Lama, Chen, Shang-Tse, Lee, Hung-yi

arXiv.org Artificial IntelligenceFeb-12-2023

With the power of large pretrained language models, various research works have integrated knowledge into dialogue systems. The traditional techniques treat knowledge as part of the input sequence for the dialogue system, prepending a set of knowledge statements in front of dialogue history. However, such a mechanism forces knowledge sets to be concatenated in an ordered manner, making models implicitly pay imbalanced attention to the sets during training. In this paper, we first investigate how the order of the knowledge set can influence autoregressive dialogue systems' responses. We conduct experiments on two commonly used dialogue datasets with two types of transformer-based models and find that models view the input knowledge unequally. To this end, we propose a simple and novel technique to alleviate the order effect by modifying the position embeddings of knowledge input in these models. With the proposed position embedding method, the experimental results show that each knowledge statement is uniformly considered to generate responses.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2302.05888

Country:

Europe > Italy (0.28)
Asia (0.28)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.51)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.38)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.38)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.34)

Add feedback

Human in the loop approaches in multi-modal conversational task guidance system development

Manuvinakurike, Ramesh, Biswas, Sovan, Raffa, Giuseppe, Beckwith, Richard, Rhodes, Anthony, Shi, Meng, Mejia, Gesem Gudino, Sahay, Saurav, Nachman, Lama

arXiv.org Artificial IntelligenceNov-3-2022

Development of task guidance systems for aiding humans in a situated task remains a challenging problem. The role of search (information retrieval) and conversational systems for task guidance has immense potential to help the task performers achieve various goals. However, there are several technical challenges that need to be addressed to deliver such conversational systems, where common supervised approaches fail to deliver the expected results in terms of overall performance, user experience and adaptation to realistic conditions. In this preliminary work we first highlight some of the challenges involved during the development of such systems. We then provide an overview of existing datasets available and highlight their limitations. We finally develop a model-in-the-loop wizard-of-oz based data collection tool and perform a pilot experiment.

artificial intelligence, natural language, text processing, (20 more...)

arXiv.org Artificial Intelligence

2211.01824

Genre:

Overview (0.54)
Research Report (0.50)

Industry: Education (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Add feedback