AITopics | Krishnaswamy, Nikhil

Collaborating Authors

Krishnaswamy, Nikhil

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

TRACE: Real-Time Multimodal Common Ground Tracking in Situated Collaborative Dialogues

VanderHoeven, Hannah, Bhalla, Brady, Khebour, Ibrahim, Youngren, Austin, Venkatesha, Videep, Bradford, Mariah, Fitzgerald, Jack, Mabrey, Carlos, Tu, Jingxuan, Zhu, Yifan, Lai, Kenneth, Jung, Changsoo, Pustejovsky, James, Krishnaswamy, Nikhil

arXiv.org Artificial IntelligenceMar-12-2025

In situations the following novel and unique contributions in a involving hybrid human-AI teams, although there single system: is an increasing desire for AIs that act as collaborators Real-time tracking of participant speech, actions, with humans, modern AI systems struggle to gesture, and gaze when engaging in a account for such mental states in their human interlocutors shared task; (Sap et al., 2022; Ullman, 2023) that might expose shared or conflicting beliefs, and thus predict On-the-fly interpretation and integration of and explain in-context behavior (Premack and multimodal signals to provide a complete Woodruff, 1978). Additionally, in realistic scenarios scene representation for inference; such as collaborative problem solving (Nelson, Simultaneous detection of asserted propositional 2013), beliefs are communicated not just through content and epistemic positioning to language, but through multimodal signals including infer task-relevant information for which evidence gestures, tone of voice, and interaction with has been raised, or which the group has the physical environment (VanderHoeven et al., agreed is factual; 2024b). Since one of the critical capabilities that makes human-human collaboration so successful is A modular, extensible architecture adaptable the human ability to interpret multiple coordinated to new tasks and scenarios.

large language model, machine learning, real time system, (18 more...)

arXiv.org Artificial Intelligence

2503.09511

Country:

Europe (1.00)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report (1.00)

Industry: Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.94)
Information Technology > Architecture > Real Time Systems (0.86)
(3 more...)

Add feedback

Speech Is Not Enough: Interpreting Nonverbal Indicators of Common Knowledge and Engagement

Palmer, Derek, Zhu, Yifan, Lai, Kenneth, VanderHoeven, Hannah, Bradford, Mariah, Khebour, Ibrahim, Mabrey, Carlos, Fitzgerald, Jack, Krishnaswamy, Nikhil, Palmer, Martha, Pustejovsky, James

arXiv.org Artificial IntelligenceDec-7-2024

Our goal is to develop an AI Partner that can provide support for group problem solving and social dynamics. In multi-party working group environments, multimodal analytics is crucial for identifying non-verbal interactions of group members. In conjunction with their verbal participation, this creates an holistic understanding of collaboration and engagement that provides necessary context for the AI Partner. In this demo, we illustrate our present capabilities at detecting and tracking nonverbal behavior in student task-oriented interactions in the classroom, and the implications for tracking common ground and engagement.

artificial intelligence, detection, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2412.05797

Country: North America > United States > Colorado (0.16)

Genre: Research Report (0.50)

Industry:

Education > Educational Setting (0.49)
Government > Regional Government > North America Government > United States Government (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.95)

Add feedback

Any Other Thoughts, Hedgehog? Linking Deliberation Chains in Collaborative Dialogues

Nath, Abhijnan, Venkatesha, Videep, Bradford, Mariah, Chelle, Avyakta, Youngren, Austin, Mabrey, Carlos, Blanchard, Nathaniel, Krishnaswamy, Nikhil

arXiv.org Artificial IntelligenceOct-25-2024

A novel task of automatically constructing Recent breakthroughs in generative AI have raised "deliberation chains" of probing questions in a the possibility of systems that follow and interact dialogue and with their causal utterances; with multiparty dialogue. Inherent in group dialogues A formal graphical framework for deliberation are utterance sequences that deliberate on chains derived from formal semantics of the same information. Modeling these is particularly situated conversation (Hunter et al., 2018); challenging; while such utterances have a linear order and overlapping information, they may A unique adaptation of methods from coreference be distantly separated in time and the same information resolution to this new task; may be expressed very differently. In this paper, we construct deliberation chains Baseline evaluation on two challenging collaborative in dialogue: turn sequences that surface pieces of dialogue datasets--DeliData and the evidence or questions under discussion that culminate Weights Task Dataset--and a novel method of in a "probing utterance," or explicit elicitation jointly modeling probing and causal interventions of input that does not introduce new information.

large language model, machine learning, utterance, (18 more...)

arXiv.org Artificial Intelligence

2410.19301

Country:

North America > United States (1.00)
Europe (1.00)

Genre:

Research Report > New Finding (0.46)
Research Report > Promising Solution (0.34)

Industry: Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both

Nath, Abhijnan, Jung, Changsoo, Seefried, Ethan, Krishnaswamy, Nikhil

arXiv.org Artificial IntelligenceOct-10-2024

Reward modeling of human preferences is one of the cornerstones of building usable generative large language models (LLMs). While traditional RLHF-based alignment methods explicitly maximize the expected rewards from a separate reward model, more recent supervised alignment methods like Direct Preference Optimization (DPO) circumvent this phase to avoid problems including model drift and reward overfitting. Although popular due to its simplicity, DPO and similar direct alignment methods can still lead to degenerate policies, and rely heavily on the Bradley-Terry-based preference formulation to model reward differences between pairs of candidate outputs. This formulation is challenged by non-deterministic or noisy preference labels, for example human scoring of two candidate outputs is of low confidence. In this paper, we introduce DRDO (Direct Reward Distillation and policy-Optimization), a supervised knowledge distillation-based preference alignment method that simultaneously models rewards and preferences to avoid such degeneracy. DRDO directly mimics rewards assigned by an oracle while learning human preferences from a novel preference likelihood formulation. Our experimental results on the Ultrafeedback and TL;DR datasets demonstrate that policies trained using DRDO surpass previous methods such as DPO and e-DPO in terms of expected rewards and are more robust, on average, to noisy preference signals as well as out-of-distribution (OOD) settings.

arxiv preprint arxiv, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2410.08458

Country: North America > United States (0.28)

Genre: Research Report (0.64)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Combating Spatial Disorientation in a Dynamic Self-Stabilization Task Using AI Assistants

Mannan, Sheikh, Hansen, Paige, Vimal, Vivekanand Pandey, Davies, Hannah N., DiZio, Paul, Krishnaswamy, Nikhil

arXiv.org Artificial IntelligenceSep-9-2024

Spatial disorientation is a leading cause of fatal aircraft accidents. This paper explores the potential of AI agents to aid pilots in maintaining balance and preventing unrecoverable losses of control by offering cues and corrective measures that ameliorate spatial disorientation. A multi-axis rotation system (MARS) was used to gather data from human subjects self-balancing in a spaceflight analog condition. We trained models over this data to create "digital twins" that exemplified performance characteristics of humans with different proficiency levels. We then trained various reinforcement learning and deep learning models to offer corrective cues if loss of control is predicted. Digital twins and assistant models then co-performed a virtual inverted pendulum (VIP) programmed with identical physics. From these simulations, we picked the 5 best-performing assistants based on task metrics such as crash frequency and mean distance from the direction of balance. These were used in a co-performance study with 20 new human subjects performing a version of the VIP task with degraded spatial information. We show that certain AI assistants were able to improve human performance and that reinforcement-learning based assistants were objectively more effective but rated as less trusted and preferable by humans.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3687272.3688329

2409.14565

Country:

Europe (0.68)
North America > United States > Colorado (0.14)
North America > United States > Massachusetts (0.14)

Genre: Research Report > Experimental Study (0.46)

Industry:

Transportation > Air (1.00)
Government > Military (0.93)
Aerospace & Defense (0.88)
Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Metacognitive AI: Framework and the Case for a Neurosymbolic Approach

Wei, Hua, Shakarian, Paulo, Lebiere, Christian, Draper, Bruce, Krishnaswamy, Nikhil, Nirenburg, Sergei

arXiv.org Artificial IntelligenceJun-17-2024

Metacognition is the concept of reasoning about an agent's own internal processes and was originally introduced in the field of developmental psychology. In this position paper, we examine the concept of applying metacognition to artificial intelligence. We introduce a framework for understanding metacognitive artificial intelligence (AI) that we call TRAP: transparency, reasoning, adaptation, and perception. We discuss each of these aspects in-turn and explore how neurosymbolic AI (NSAI) can be leveraged to address challenges of metacognition.

explanation, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

2406.12147

Country: North America > United States (0.93)

Genre: Research Report (0.82)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science > Cognitive Architectures (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Computational Thought Experiments for a More Rigorous Philosophy and Science of the Mind

Oved, Iris, Krishnaswamy, Nikhil, Pustejovsky, James, Hartshorne, Joshua

arXiv.org Artificial IntelligenceMay-14-2024

We offer philosophical motivations for a method we call Virtual World Cognitive Science (VW CogSci), in which researchers use virtual embodied agents that are embedded in virtual worlds to explore questions in the field of Cognitive Science. We focus on questions about mental and linguistic representation and the ways that such computational modeling can add rigor to philosophical thought experiments, as well as the terminology used in the scientific study of such representations. We find that this method forces researchers to take a god's-eye view when describing dynamical relationships between entities in minds and entities in an environment in a way that eliminates the need for problematic talk of belief and concept types, such as the belief that cats are silly, and the concept CAT, while preserving belief and concept tokens in individual cognizers' minds. We conclude with some further key advantages of VW CogSci for the scientific study of mental and linguistic representation and for Cognitive Science more broadly.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2405.08304

Country:

North America > United States > New York (0.14)
Europe > United Kingdom > England (0.14)
North America > United States > Massachusetts > Middlesex County (0.14)
(2 more...)

Genre: Research Report (0.40)

Industry: Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Cognitive Architectures (0.76)

Add feedback

Multimodal Cross-Document Event Coreference Resolution Using Linear Semantic Transfer and Mixed-Modality Ensembles

Nath, Abhijnan, Jamil, Huma, Ahmed, Shafiuddin Rehan, Baker, George, Ghosh, Rahul, Martin, James H., Blanchard, Nathaniel, Krishnaswamy, Nikhil

arXiv.org Artificial IntelligenceApr-13-2024

Event coreference resolution (ECR) is the task of determining whether distinct mentions of events within a multi-document corpus are actually linked to the same underlying occurrence. Images of the events can help facilitate resolution when language is ambiguous. Here, we propose a multimodal cross-document event coreference resolution method that integrates visual and textual cues with a simple linear map between vision and language models. As existing ECR benchmark datasets rarely provide images for all event mentions, we augment the popular ECB+ dataset with event-centric images scraped from the internet and generated using image diffusion models. We establish three methods that incorporate images and text for coreference: 1) a standard fused model with finetuning, 2) a novel linear mapping method without finetuning and 3) an ensembling approach based on splitting mention pairs by semantic and discourse-level difficulty. We evaluate on 2 datasets: the augmented ECB+, and AIDA Phase 1. Our ensemble systems using cross-modal linear mapping establish an upper limit (91.9 CoNLL F1) on ECB+ ECR performance given the preprocessing assumptions used, and establish a novel baseline on AIDA Phase 1. Our results demonstrate the utility of multimodal information in ECR for certain challenging coreference problems, and highlight a need for more multimodal resources in the coreference resolution space.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2404.08949

Country:

Asia > Middle East (0.68)
North America > United States > Indiana > Tippecanoe County (0.14)
North America > United States > Colorado > Boulder County > Boulder (0.14)

Genre: Research Report > New Finding (0.54)

Industry:

Government > Regional Government (1.00)
Law (0.67)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
(2 more...)

Add feedback

Okay, Let's Do This! Modeling Event Coreference with Generated Rationales and Knowledge Distillation

Nath, Abhijnan, Manafi, Shadi, Chelle, Avyakta, Krishnaswamy, Nikhil

arXiv.org Artificial IntelligenceApr-4-2024

In NLP, Event Coreference Resolution (ECR) is the task of connecting event clusters that refer to the same underlying real-life event, usually via neural systems. In this work, we investigate using abductive free-text rationales (FTRs) generated by modern autoregressive LLMs as distant supervision of smaller student models for cross-document coreference (CDCR) of events. We implement novel rationale-oriented event clustering and knowledge distillation methods for event coreference scoring that leverage enriched information from the FTRs for improved CDCR without additional annotation or expensive document clustering. Our model using coreference specific knowledge distillation achieves SOTA B3 F1 on the ECB+ and GVC corpora and we establish a new baseline on the AIDA Phase 1 corpus. Our code can be found at https://github.com/csu-signal/llama_cdcr

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2404.03196

Country:

Europe (1.00)
Asia (0.68)
North America > United States > Colorado (0.28)

Genre: Research Report (0.82)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.68)
Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Cross-Lingual Transfer Robustness to Lower-Resource Languages on Adversarial Datasets

Manafi, Shadi, Krishnaswamy, Nikhil

arXiv.org Artificial IntelligenceMar-29-2024

Multilingual Language Models (MLLMs) exhibit robust cross-lingual transfer capabilities, or the ability to leverage information acquired in a source language and apply it to a target language. These capabilities find practical applications in well-established Natural Language Processing (NLP) tasks such as Named Entity Recognition (NER). This study aims to investigate the effectiveness of a source language when applied to a target language, particularly in the context of perturbing the input test set. We evaluate on 13 pairs of languages, each including one high-resource language (HRL) and one low-resource language (LRL) with a geographic, genetic, or borrowing relationship. We evaluate two well-known MLLMs--MBERT and XLM-R--on these pairs, in native LRL and cross-lingual transfer settings, in two tasks, under a set of different perturbations. Our findings indicate that NER cross-lingual transfer depends largely on the overlap of entity chunks. If a source and target language have more entities in common, the transfer ability is stronger. Models using cross-lingual transfer also appear to be somewhat more robust to certain perturbations of the input, perhaps indicating an ability to leverage stronger representations derived from the HRL. Our research provides valuable insights into cross-lingual transfer and its implications for NLP applications, and underscores the need to consider linguistic nuances and potential limitations when employing MLLMs across distinct languages.

large language model, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2403.20056

Country:

North America > United States > Minnesota (0.14)
North America > United States > Colorado (0.14)

Genre:

Research Report > Experimental Study (0.90)
Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback