AITopics | Chhaya, Niyati

Collaborating Authors

Chhaya, Niyati

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Is your LLM trapped in a Mental Set? Investigative study on how mental sets affect the reasoning capabilities of LLMs

Haq, Saiful, Chhaya, Niyati, Pandey, Piyush, Bhattacharya, Pushpak

arXiv.org Artificial IntelligenceJan-20-2025

In this paper, we present an investigative study on how Mental Sets influence the reasoning capabilities of LLMs. LLMs have excelled in diverse natural language processing (NLP) tasks, driven by advancements in parameter-efficient fine-tuning (PEFT) and emergent capabilities like in-context learning (ICL). For complex reasoning tasks, selecting the right model for PEFT or ICL is critical, often relying on scores on benchmarks such as MMLU, MATH, and GSM8K. However, current evaluation methods, based on metrics like F1 Score or reasoning chain assessments by larger models, overlook a key dimension: adaptability to unfamiliar situations and overcoming entrenched thinking patterns. In cognitive psychology, Mental Set refers to the tendency to persist with previously successful strategies, even when they become inefficient - a challenge for problem solving and reasoning. We compare the performance of LLM models like Llama-3.1-8B-Instruct, Llama-3.1-70B-Instruct and GPT-4o in the presence of mental sets. To the best of our knowledge, this is the first study to integrate cognitive psychology concepts into the evaluation of LLMs for complex reasoning tasks, providing deeper insights into their adaptability and problem-solving efficacy.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2501.11833

Genre: Research Report > New Finding (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

"Let's not Quote out of Context": Unified Vision-Language Pretraining for Context Assisted Image Captioning

Kalarani, Abisek Rajakumar, Bhattacharyya, Pushpak, Chhaya, Niyati, Shekhar, Sumit

arXiv.org Artificial IntelligenceJun-1-2023

Well-formed context aware image captions and tags in enterprise content such as marketing material are critical to ensure their brand presence and content recall. Manual creation and updates to ensure the same is non trivial given the scale and the tedium towards this task. We propose a new unified Vision-Language (VL) model based on the One For All (OFA) model, with a focus on context-assisted image captioning where the caption is generated based on both the image and its context. Our approach aims to overcome the context-independent (image and text are treated independently) nature of the existing approaches. We exploit context by pretraining our model with datasets of three tasks: news image captioning where the news article is the context, contextual visual entailment, and keyword extraction from the context. The second pretraining task is a new VL task, and we construct and release two datasets for the task with 1.1M and 2.2K data instances. Our system achieves state-of-the-art results with an improvement of up to 8.34 CIDEr score on the benchmark news image captioning datasets. To the best of our knowledge, ours is the first effort at incorporating contextual information in pretraining the models for the VL tasks.

artificial intelligence, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2306.00931

Country: North America > United States > Oregon (0.14)

Genre:

Research Report (0.64)
Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.72)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)

Add feedback

Variational Fusion for Multimodal Sentiment Analysis

Majumder, Navonil, Poria, Soujanya, Krishnamurthy, Gangeshwar, Chhaya, Niyati, Mihalcea, Rada, Gelbukh, Alexander

arXiv.org Artificial IntelligenceAug-13-2019

This is important, as more and more enterprises tend to make business decisions based on the user sentiment behind their products as expressed through these videos. Multimodal fusion is considered a key step in multimodal sentiment analysis. Most recent work on multimodal fusion (Poria et al., 2017; Zadeh et al., 2018c) has focused on the strategy of obtaining a multimodal representation from the independent unimodal representations. Our approach takes this strategy one step further, by also requiring that the original unimodal representations be reconstructed from the unified multimodal representation. The motivation behind this is the intuition that different modalities are an expression of the state of the mind. Hence, if we assume that the fused representation is the mind-state/sentiment/emotion, then in our approach we are ensuring that the fused representation can be mapped back to the unimodal representations, which should improve the quality of the multi-modal representation. In this paper, we empirically argue that this is the case by showing that this approach outperforms the state-of-the-art in mul-timodal fusion. We employ a variational autoencoder (V AE) (Kingma and Welling, 2014), where the encoder network generates a latent representation from the unimodal representations.

deep learning, neural network, representation, (21 more...)

arXiv.org Artificial Intelligence

1908.06008

Country:

Asia (0.29)
Oceania > Australia (0.14)
North America > United States (0.14)
Europe > Denmark (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.84)

Add feedback

Reports of the Workshops of the 32nd AAAI Conference on Artificial Intelligence

Bouchard, Bruno (Université du Québec à Chicoutimi) | Bouchard, Kevin (Université du Québec à Chicoutimi) | Brown, Noam (Carnegie Mellon University) | Chhaya, Niyati (Adobe Research, Bangalore) | Farchi, Eitan (IBM Research, Haifa) | Gaboury, Sebastien (Université du Québec à Chicoutimi) | Geib, Christopher (Smart Information Flow Technologies) | Gyrard, Amelie (Wright State University) | Jaidka, Kokil (University of Pennsylvania) | Keren, Sarah (Technion – Israel Institute of Technology) | Khardon, Roni (Tufts University) | Kordjamshidi, Parisa (Tulane University) | Martinez, David (MIT Lincoln Laboratory) | Mattei, Nicholas (IBM Research, TJ Watson) | Michalowski, Martin (University of Minnesota School of Nursing) | Mirsky, Reuth (Ben Gurion University) | Osborn, Joseph (Pomona College) | Sahin, Cem (MIT Lincoln Laboratory) | Shehory, Onn (Bar Ilan University) | Shaban-Nejad, Arash (University of Tennessee Health Science Center) | Sheth, Amit (Wright State University) | Shimshoni, Ilan (University of Haifa) | Shrobe, Howie (Massachusetts Institute of Technology) | Sinha, Arunesh (University of Southern California.) | Sinha, Atanu R. (Adobe Research, Bangalore) | Srivastava, Biplav (IBM Research, Yorktown Height) | Streilein, William (MIT Lincoln Laboratory) | Theocharous, Georgios (Adobe Research, San Jose) | Venable, K. Brent (Tulane University and IHMC) | Wagner, Neal (MIT Lincoln Laboratory) | Zamansky, Anna (University of Haifa)

AI MagazineDec-14-2018

The AAAI-18 workshop program included 15 workshops covering a wide range of topics in AI. Workshops were held Sunday and Monday, February 2–7, 2018, at the Hilton New Orleans Riverside in New Orleans, Louisiana, USA. This report contains summaries of the Affective Content Analysis workshop; the Artificial Intelligence Applied to Assistive Technologies and Smart Environments; the AI and Marketing Science workshop; the Artificial Intelligence for Cyber Security workshop; the AI for Imperfect-Information Games; the Declarative Learning Based Programming workshop; the Engineering Dependable and Secure Machine Learning Systems workshop; the Health Intelligence workshop; the Knowledge Extraction from Games workshop; the Plan, Activity, and Intent Recognition workshop; the Planning and Inference workshop; the Preference Handling workshop; the Reasoning and Learning for Human-Machine Dialogues workshop; and the the AI Enhanced Internet of Things Data Processing for Intelligent Applications workshop.

computer game, constraint-based reasoning, workshop, (25 more...)

AI Magazine

Country:

North America > United States > California (0.46)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.44)
North America > Canada > Ontario > Toronto (0.14)

Genre:

Research Report (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.68)
(3 more...)

Add feedback

Aff2Vec: Affect--Enriched Distributional Word Representations

Khosla, Sopan, Chhaya, Niyati, Chawla, Kushal

arXiv.org Artificial IntelligenceMay-21-2018

Human communication includes information, opinions, and reactions. Reactions are often captured by the affective-messages in written as well as verbal communications. While there has been work in affect modeling and to some extent affective content generation, the area of affective word distributions in not well studied. Synsets and lexica capture semantic relationships across words. These models however lack in encoding affective or emotional word interpretations. Our proposed model, Aff2Vec provides a method for enriched word embeddings that are representative of affective interpretations of words. Aff2Vec outperforms the state--of--the--art in intrinsic word-similarity tasks. Further, the use of Aff2Vec representations outperforms baseline embeddings in downstream natural language understanding tasks including sentiment analysis, personality detection, and frustration prediction.

neural network, representation, survey article, (22 more...)

arXiv.org Artificial Intelligence

1805.07966

Country:

North America > United States (0.14)
Asia (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Editorial for the AAAI-18 Workshop on Affective Content Analysis

Chhaya, Niyati (Big Data Experience Lab, Adobe Research) | Jaidka, Kokil (University of Pennsylvania) | Ungar, Lyle H. (University of Pennsylvania)

AAAI ConferencesApr-6-2018

The first AAAI-18 Workshop on Affective Content Analysis was an interdisciplinary platform that focused on the analysis of emotions, sentiments, and attitudes in textual, visual, and multimodal content for applications in psychology, consumer behavior, language understanding, and computer vision. The program comprised interdisciplinary keynotes, original research presentations, a poster session and short pitches for datasets and pre-published work.

aaai-18 workshop, affective content analysis, editorial

AAAI Conferences

Workshops at the Thirty-Second AAAI Conference on Artificial Intelligence

Technology:

Information Technology > Data Science > Data Mining (0.60)
Information Technology > Artificial Intelligence (0.53)

Add feedback