krishnaswamy
Dude, where's my utterance? Evaluating the effects of automatic segmentation and transcription on CPS detection
Venkatesha, Videep, Bradford, Mariah, Blanchard, Nathaniel
Collaborative Problem-Solving (CPS) markers capture key aspects of effective teamwork, such as staying on task, avoiding interruptions, and generating constructive ideas. An AI system that reliably detects these markers could help teachers identify when a group is struggling or demonstrating productive collaboration. Such a system requires an automated pipeline composed of multiple components. In this work, we evaluate how CPS detection is impacted by automating two critical components: transcription and speech segmentation. On the public Weights Task Dataset (WTD), we find CPS detection performance with automated transcription and segmentation methods is comparable to human-segmented and manually transcribed data; however, we find the automated segmentation methods reduces the number of utterances by 26.5%, impacting the the granularity of the data. We discuss the implications for developing AI-driven tools that support collaborative learning in classrooms.
- North America > United States > Georgia > Fulton County > Atlanta (0.04)
- North America > United States > Colorado > Larimer County > Fort Collins (0.04)
- Europe > Spain > Galicia > Madrid (0.04)
- (2 more...)
- Education (0.94)
- Government > Regional Government > North America Government > United States Government (0.69)
InfoGain Wavelets: Furthering the Design of Diffusion Wavelets for Graph-Structured Data
Johnson, David R., Krishnaswamy, Smita, Perlmutter, Michael
Diffusion wavelets extract information from graph signals at different scales of resolution by utilizing graph diffusion operators raised to various powers, known as diffusion scales. Traditionally, the diffusion scales are chosen to be dyadic integers, $\mathbf{2^j}$. Here, we propose a novel, unsupervised method for selecting the diffusion scales based on ideas from information theory. We then show that our method can be incorporated into wavelet-based GNNs via graph classification experiments.
- North America > United States > Idaho > Ada County > Boise (0.04)
- North America > United States > Connecticut > New Haven County > New Haven (0.04)
- Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
- Health & Medicine > Therapeutic Area > Oncology (0.93)
Speech Is Not Enough: Interpreting Nonverbal Indicators of Common Knowledge and Engagement
Palmer, Derek, Zhu, Yifan, Lai, Kenneth, VanderHoeven, Hannah, Bradford, Mariah, Khebour, Ibrahim, Mabrey, Carlos, Fitzgerald, Jack, Krishnaswamy, Nikhil, Palmer, Martha, Pustejovsky, James
Our goal is to develop an AI Partner that can provide support for group problem solving and social dynamics. In multi-party working group environments, multimodal analytics is crucial for identifying non-verbal interactions of group members. In conjunction with their verbal participation, this creates an holistic understanding of collaboration and engagement that provides necessary context for the AI Partner. In this demo, we illustrate our present capabilities at detecting and tracking nonverbal behavior in student task-oriented interactions in the classroom, and the implications for tracking common ground and engagement.
- North America > United States > Colorado > Boulder County > Boulder (0.04)
- Europe > Norway > Central Norway > Trøndelag > Trondheim (0.04)
- Europe > France (0.04)
Computational Thought Experiments for a More Rigorous Philosophy and Science of the Mind
Oved, Iris, Krishnaswamy, Nikhil, Pustejovsky, James, Hartshorne, Joshua
We offer philosophical motivations for a method we call Virtual World Cognitive Science (VW CogSci), in which researchers use virtual embodied agents that are embedded in virtual worlds to explore questions in the field of Cognitive Science. We focus on questions about mental and linguistic representation and the ways that such computational modeling can add rigor to philosophical thought experiments, as well as the terminology used in the scientific study of such representations. We find that this method forces researchers to take a god's-eye view when describing dynamical relationships between entities in minds and entities in an environment in a way that eliminates the need for problematic talk of belief and concept types, such as the belief that cats are silly, and the concept CAT, while preserving belief and concept tokens in individual cognizers' minds. We conclude with some further key advantages of VW CogSci for the scientific study of mental and linguistic representation and for Cognitive Science more broadly.
- North America > United States > California > San Francisco County > San Francisco (0.14)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- (8 more...)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
- Information Technology > Artificial Intelligence > Cognitive Science > Cognitive Architectures (0.76)
Exploring Failure Cases in Multimodal Reasoning About Physical Dynamics
Ghaffari, Sadaf, Krishnaswamy, Nikhil
In this paper, we present an exploration of LLMs' abilities to problem solve with physical reasoning in situated environments. We construct a simple simulated environment and demonstrate examples of where, in a zero-shot setting, both text and multimodal LLMs display atomic world knowledge about various objects but fail to compose this knowledge in correct solutions for an object manipulation and placement task. We also use BLIP, a vision-language model trained with more sophisticated cross-modal attention, to identify cases relevant to object physical properties that that model fails to ground. Finally, we present a procedure for discovering the relevant properties of objects in the environment and propose a method to distill this knowledge back into the LLM.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- North America > United States > Colorado > Larimer County > Fort Collins (0.04)
Manifold Filter-Combine Networks
Chew, Joyce, De Brouwer, Edward, Krishnaswamy, Smita, Needell, Deanna, Perlmutter, Michael
We introduce a class of manifold neural networks (MNNs) that we call Manifold Filter-Combine Networks (MFCNs), which aims to further our understanding of MNNs, analogous to how the aggregate-combine framework helps with the understanding of graph neural networks (GNNs). This class includes a wide variety of subclasses that can be thought of as the manifold analog of various popular GNNs. We then consider a method, based on building a data-driven graph, for implementing such networks when one does not have global knowledge of the manifold, but merely has access to finitely many points sampled from some probability distribution on the manifold. We provide sufficient conditions for the network to provably converge to its continuum limit as the number of sample points tends to infinity. Unlike previous work (which focused on specific graph constructions and assumed that the data was drawn from the uniform distribution), our rate of convergence does not directly depend on the number of filters used. Moreover, it exhibits linear dependence on the depth of the network rather than the exponential dependence obtained previously. Additionally, we provide several examples of interesting subclasses of MFCNs and of the rates of convergence that are obtained under specific graph constructions.
- North America > United States > California > Los Angeles County > Los Angeles (0.14)
- North America > United States > Idaho > Ada County > Boise (0.04)
- North America > United States > Connecticut > New Haven County > New Haven (0.04)
- Research Report (0.64)
- Overview (0.46)
Grounding and Distinguishing Conceptual Vocabulary Through Similarity Learning in Embodied Simulations
Ghaffari, Sadaf, Krishnaswamy, Nikhil
We present a novel method for using agent experiences gathered through an embodied simulation to ground contextualized word vectors to object representations. We use similarity learning to make comparisons between different object types based on their properties when interacted with, and to extract common features pertaining to the objects' behavior. We then use an affine transformation to calculate a projection matrix that transforms contextualized word vectors from different transformer-based language models into this learned space, and evaluate whether new test instances of transformed token vectors identify the correct concept in the object embedding space. Our results expose properties of the embedding spaces of four different transformer models and show that grounding object token vectors is usually more helpful to grounding verb and attribute token vectors than the reverse, which reflects earlier conclusions in the analogical reasoning and psycholinguistic literature.
- North America > United States > Colorado > Larimer County > Fort Collins (0.04)
- North America > United States > California > San Diego County > San Diego (0.04)
- Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
- Asia > China > Hong Kong (0.04)
- Research Report > Promising Solution (0.34)
- Research Report > New Finding (0.34)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.88)
Krishnaswamy
We describe a predictive modeling and decision support framework that integrates machine learning and optimization for personalized clinical decision support. We pilot the approach on data from a congestive heart failure patient cohort, and demonstrate the ability to predict and optimize readmission risk in a clinically meaningful manner.
Schools Look for Help From AI Teacher's Assistants
ProJo can also help students work together and assess their growth and weaknesses, in both robot form and on a computer screen. It is one of a variety of teaching aids in development, boosted by artificial intelligence, that scientists and educators say could support tomorrow's classrooms. Typically, AI education products serve one function, such as assessing a student's literacy, tailoring tools to individual learners or performing administrative functions such as grading. Next-generation tools may do all of this in a single platform, serving at times as a peer learning partner, a group facilitator and a monitor for educators--a sort of superpowered teacher's assistant personalized for each student. A look at how innovation and technology are transforming the way we live, work and play.
- North America > United States > Massachusetts (0.05)
- North America > United States > Colorado > Boulder County > Longmont (0.05)
Neurosymbolic AI for Situated Language Understanding
Krishnaswamy, Nikhil, Pustejovsky, James
In recent years, data-intensive AI, particularly the domain of natural language processing and understanding, has seen significant progress driven by the advent of large datasets and deep neural networks that have sidelined more classic AI approaches to the field. These systems can apparently demonstrate sophisticated linguistic understanding or generation capabilities, but often fail to transfer their skills to situations they have not encountered before. We argue that computational situated grounding provides a solution to some of these learning challenges by creating situational representations that both serve as a formal model of the salient phenomena, and contain rich amounts of exploitable, task-appropriate data for training new, flexible computational models. Our model reincorporates some ideas of classic AI into a framework of neurosymbolic intelligence, using multimodal contextual modeling of interactive situations, events, and object properties. We discuss how situated grounding provides diverse data and multiple levels of modeling for a variety of AI learning challenges, including learning how to interact with object affordances, learning semantics for novel structures and configurations, and transferring such learned knowledge to new objects and situations.
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
- North America > United States > Massachusetts > Middlesex County > Waltham (0.04)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- (3 more...)
- Government > Regional Government > North America Government > United States Government (0.68)
- Government > Military (0.68)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)