AITopics | Ghosh, Shreya

Collaborating Authors

Ghosh, Shreya

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

TripCraft: A Benchmark for Spatio-Temporally Fine Grained Travel Planning

Chaudhuri, Soumyabrata, Purkar, Pranav, Raghav, Ritwik, Mallick, Shubhojit, Gupta, Manish, Jana, Abhik, Ghosh, Shreya

arXiv.org Artificial IntelligenceFeb-27-2025

Recent advancements in probing Large Language Models (LLMs) have explored their latent potential as personalized travel planning agents, yet existing benchmarks remain limited in real world applicability. Existing datasets, such as TravelPlanner and TravelPlanner+, suffer from semi synthetic data reliance, spatial inconsistencies, and a lack of key travel constraints, making them inadequate for practical itinerary generation. To address these gaps, we introduce TripCraft, a spatiotemporally coherent travel planning dataset that integrates real world constraints, including public transit schedules, event availability, diverse attraction categories, and user personas for enhanced personalization. To evaluate LLM generated plans beyond existing binary validation methods, we propose five continuous evaluation metrics, namely Temporal Meal Score, Temporal Attraction Score, Spatial Score, Ordering Score, and Persona Score which assess itinerary quality across multiple dimensions. Our parameter informed setting significantly enhances meal scheduling, improving the Temporal Meal Score from 61% to 80% in a 7 day scenario. TripCraft establishes a new benchmark for LLM driven personalized travel planning, offering a more realistic, constraint aware framework for itinerary generation. Dataset and Codebase will be made publicly available upon acceptance.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2502.20508

Country: North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report (0.64)

Industry: Consumer Products & Services > Travel (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

A Racing Dataset and Baseline Model for Track Detection in Autonomous Racing

Ghosh, Shreya, Chen, Yi-Huan, Huang, Ching-Hsiang, Jameel, Abu Shafin Mohammad Mahdee, Ho, Chien Chou, Gamal, Aly El, Labi, Samuel

arXiv.org Artificial IntelligenceFeb-19-2025

--A significant challenge in racing-related research is the lack of publicly available datasets containing raw images with corresponding annotations for the downstream task. In this paper, we introduce RoRaTrack, a novel dataset that contains annotated multi-camera image data from racing scenarios for track detection. The data is collected on a Dallara A V-21 at a racing circuit in Indiana, in collaboration with the Indy Autonomous Challenge (IAC). RoRaTrack addresses common problems such as blurriness due to high speed, color inversion from the camera, and absence of lane markings on the track. Consequently, we propose RaceGAN, a baseline model based on a Generative Adversarial Network (GAN) that effectively addresses these challenges. The proposed model demonstrates superior performance compared to current state-of-the-art machine learning models in track detection. The dataset and code for this work are available at github.com/RaceGAN. Modern vehicles are increasingly equipped with a range of computer vision technologies to assist drivers and improve road safety. A critical application of these technologies, particularly for autonomous and self-driving vehicles, is lane detection, which ensures that vehicles remain within designated lanes [1]. Lane detection systems not only help maintain proper lane alignment, but also provide visual cues to drivers about lane boundaries. Similarly, autonomous technologies are being integrated into race cars, giving rise to the emerging field of autonomous racing. In this domain, vehicles operate entirely without human intervention, relying solely on artificial intelligence and computer vision algorithms [2].

artificial intelligence, detection, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2502.14068

Country: North America > United States > Indiana (0.34)

Genre: Research Report (0.84)

Industry: Leisure & Entertainment > Sports > Motorsports (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

UNITE-FND: Reframing Multimodal Fake News Detection through Unimodal Scene Translation

Mukherjee, Arka, Ghosh, Shreya

arXiv.org Artificial IntelligenceFeb-16-2025

Multimodal fake news detection typically demands complex architectures and substantial computational resources, posing deployment challenges in real-world settings. We introduce UNITE-FND, a novel framework that reframes multimodal fake news detection as a unimodal text classification task. We propose six specialized prompting strategies with Gemini 1.5 Pro, converting visual content into structured textual descriptions, and enabling efficient text-only models to preserve critical visual information. To benchmark our approach, we introduce Uni-Fakeddit-55k, a curated dataset family of 55,000 samples each, each processed through our multimodal-to-unimodal translation framework. Experimental results demonstrate that UNITE-FND achieves 92.52% accuracy in binary classification, surpassing prior multimodal models while reducing computational costs by over 10x (TinyBERT variant: 14.5M parameters vs. 250M+ in SOTA models). Additionally, we propose a comprehensive suite of five novel metrics to evaluate image-to-text conversion quality, ensuring optimal information preservation. Our results demonstrate that structured text-based representations can replace direct multimodal processing with minimal loss of accuracy, making UNITE-FND a practical and scalable alternative for resource-constrained environments.

data mining, large language model, machine learning, (24 more...)

arXiv.org Artificial Intelligence

2502.11132

Country:

North America > United States (0.46)
Europe (0.46)

Genre: Research Report > New Finding (0.88)

Industry: Media > News (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Add feedback

Exploring Language Model Generalization in Low-Resource Extractive QA

Sengupta, Saptarshi, Yin, Wenpeng, Nakov, Preslav, Ghosh, Shreya, Wang, Suhang

arXiv.org Artificial IntelligenceDec-12-2024

In this paper, we investigate Extractive Question Answering (EQA) with Large Language Models (LLMs) under domain drift, i.e., can LLMs generalize to domains that require specific knowledge such as medicine and law in a zero-shot fashion without additional in-domain training? To this end, we devise a series of experiments to explain the performance gap empirically. Our findings suggest that: (a) LLMs struggle with dataset demands of closed domains such as retrieving long answer spans; (b) Certain LLMs, despite showing strong overall performance, display weaknesses in meeting basic requirements as discriminating between domain-specific senses of words which we link to pre-processing decisions; (c) Scaling model parameters is not always effective for cross domain generalization; and (d) Closed-domain datasets are quantitatively much different than open-domain EQA datasets and current LLMs struggle to deal with them. Our findings point out important directions for improving existing LLMs.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2409.18446

Country:

Europe (1.00)
North America > United States > Texas > Harris County > Houston (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine (1.00)
Education (1.00)
Leisure & Entertainment (0.94)
Media > Music (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Machine Learning to Detect Anxiety Disorders from Error-Related Negativity and EEG Signals

Chandrasekar, Ramya, Hasan, Md Rakibul, Ghosh, Shreya, Gedeon, Tom, Hossain, Md Zakir

arXiv.org Artificial IntelligenceSep-16-2024

Anxiety is endemic to every person, with an occurrence rate of approximately 20% [World Health Organization, 2017]. Between 2020 and 2022, over one in six people (17.2% or 3.4 million people) aged 16 to 85 years experienced an anxiety disorder [Australian Bureau of Statistics]. Anxiety is caused by changes in the situation, nervousness and common symptoms, including sweating, trembling and excessive worrying, which affect a person's daily life. Anxiety disorders encompass a range of conditions, such as generalised anxiety disorder (GAD), panic disorder (PD), social anxiety disorder (SAD), obsessive-compulsive disorder (OCD), various phobia-related disorders, physical pain related protective behaviour [Li et al., 2020, 2021] and depression [Ghosh and Anwar, 2021]. Current clinical approaches for diagnosing these disorders often suffer from limitations in accuracy and objectivity, relying heavily on self-reports, patient histories and clinical observations. These methods can be subjective and may not capture the nuanced neural and behavioural patterns associated with anxiety, leading to potential misdiagnoses. Recent research has shown promising results in using machine learning techniques to detect anxiety through physiological analysis [Abd-Alrazaq et al., 2023], such as respiration, electrocardiogram (ECG), photoplethysmography (PPG), electrodermal response (EDA) and electroencephalography (EEG), to identify patterns associated with anxiety states [Abd-Alrazaq et al., 2023].

artificial intelligence, disorder, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2410.00028

Country:

Asia (0.68)
Oceania > Australia (0.48)
Europe (0.46)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Anxiety Disorder (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

FunnelNet: An End-to-End Deep Learning Framework to Monitor Digital Heart Murmur in Real-Time

Jobayer, Md, Shawon, Md. Mehedi Hasan, Hasan, Md Rakibul, Ghosh, Shreya, Gedeon, Tom, Hossain, Md Zakir

arXiv.org Artificial IntelligenceMay-9-2024

Objective: Heart murmurs are abnormal sounds caused by turbulent blood flow within the heart. Several diagnostic methods are available to detect heart murmurs and their severity, such as cardiac auscultation, echocardiography, phonocardiogram (PCG), etc. However, these methods have limitations, including extensive training and experience among healthcare providers, cost and accessibility of echocardiography, as well as noise interference and PCG data processing. This study aims to develop a novel end-to-end real-time heart murmur detection approach using traditional and depthwise separable convolutional networks. Methods: Continuous wavelet transform (CWT) was applied to extract meaningful features from the PCG data. The proposed network has three parts: the Squeeze net, the Bottleneck, and the Expansion net. The Squeeze net generates a compressed data representation, whereas the Bottleneck layer reduces computational complexity using a depthwise-separable convolutional network. The Expansion net is responsible for up-sampling the compressed data to a higher dimension, capturing tiny details of the representative data. Results: For evaluation, we used four publicly available datasets and achieved state-of-the-art performance in all datasets. Furthermore, we tested our proposed network on two resource-constrained devices: a Raspberry PI and an Android device, stripping it down into a tiny machine learning model (TinyML), achieving a maximum of 99.70%. Conclusion: The proposed model offers a deep learning framework for real-time accurate heart murmur detection within limited resources. Significance: It will significantly result in more accessible and practical medical services and reduced diagnosis time to assist medical professionals. The code is publicly available at TBA.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2405.0957

Country:

Europe (0.93)
North America > United States > California (0.14)

Genre: Research Report > New Finding (0.68)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Diagnostic Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Milestones in Bengali Sentiment Analysis leveraging Transformer-models: Fundamentals, Challenges and Future Directions

Sengupta, Saptarshi, Ghosh, Shreya, Mitra, Prasenjit, Tamiti, Tarikul Islam

arXiv.org Artificial IntelligenceJan-15-2024

Sentiment Analysis (SA) refers to the task of associating a view polarity (usually, positive, negative, or neutral; or even fine-grained such as slightly angry, sad, etc.) to a given text, essentially breaking it down to a supervised (since we have the view labels apriori) classification task. Although heavily studied in resource-rich languages such as English thus pushing the SOTA by leaps and bounds, owing to the arrival of the Transformer architecture, the same cannot be said for resource-poor languages such as Bengali (BN). For a language spoken by roughly 300 million people, the technology enabling them to run trials on their favored tongue is severely lacking. In this paper, we analyze the SOTA for SA in Bengali, particularly, Transformer-based models. We discuss available datasets, their drawbacks, the nuances associated with Bengali i.e. what makes this a challenging language to apply SA on, and finally provide insights for future direction to mitigate the limitations in the field.

computational linguistic, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2401.07847

Country:

Asia (0.93)
Europe (0.68)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Overview (1.00)

Industry: Information Technology (0.68)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
(2 more...)

Add feedback

Improving Transferability of Network Intrusion Detection in a Federated Learning Setup

Ghosh, Shreya, Jameel, Abu Shafin Mohammad Mahdee, Gamal, Aly El

arXiv.org Artificial IntelligenceJan-7-2024

Network Intrusion Detection Systems (IDS) aim to detect the presence of an intruder by analyzing network packets arriving at an internet connected device. Data-driven deep learning systems, popular due to their superior performance compared to traditional IDS, depend on availability of high quality training data for diverse intrusion classes. A way to overcome this limitation is through transferable learning, where training for one intrusion class can lead to detection of unseen intrusion classes after deployment. In this paper, we provide a detailed study on the transferability of intrusion detection. We investigate practical federated learning configurations to enhance the transferability of intrusion detection. We propose two techniques to significantly improve the transferability of a federated intrusion detection system. The code for this work can be found at https://github.com/ghosh64/transferability.

artificial intelligence, deep learning, machine learning, (12 more...)

arXiv.org Artificial Intelligence

2401.0356

Genre: Research Report (0.50)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Study on Transferability of Deep Learning Models for Network Intrusion Detection

Ghosh, Shreya, Jameel, Abu Shafin Mohammad Mahdee, Gamal, Aly El

arXiv.org Artificial IntelligenceDec-17-2023

In this paper, we explore transferability in learning between different attack classes in a network intrusion detection setup. We evaluate transferability of attack classes by training a deep learning model with a specific attack class and testing it on a separate attack class. We observe the effects of real and synthetically generated data augmentation techniques on transferability. We investigate the nature of observed transferability relationships, which can be either symmetric or asymmetric. We also examine explainability of the transferability relationships using the recursive feature elimination algorithm. We study data preprocessing techniques to boost model performance. The code for this work can be found at https://github.com/ghosh64/transferability.

artificial intelligence, deep learning, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2312.1155

Genre: Research Report (1.00)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Empathy Detection Using Machine Learning on Text, Audiovisual, Audio or Physiological Signals

Hasan, Md Rakibul, Hossain, Md Zakir, Ghosh, Shreya, Soon, Susannah, Gedeon, Tom

arXiv.org Artificial IntelligenceOct-30-2023

Empathy is a social skill that indicates an individual's ability to understand others. Over the past few years, empathy has drawn attention from various disciplines, including but not limited to Affective Computing, Cognitive Science and Psychology. Empathy is a context-dependent term; thus, detecting or recognising empathy has potential applications in society, healthcare and education. Despite being a broad and overlapping topic, the avenue of empathy detection studies leveraging Machine Learning remains underexplored from a holistic literature perspective. To this end, we systematically collect and screen 801 papers from 10 well-known databases and analyse the selected 54 papers. We group the papers based on input modalities of empathy detection systems, i.e., text, audiovisual, audio and physiological signals. We examine modality-specific pre-processing and network architecture design protocols, popular dataset descriptions and availability details, and evaluation protocols. We further discuss the potential applications, deployment challenges and research gaps in the Affective Computing-based empathy domain, which can facilitate new avenues of exploration. We believe that our work is a stepping stone to developing a privacy-preserving and unbiased empathic system inclusive of culture, diversity and multilingualism that can be deployed in practice to enhance the overall well-being of human life.

artificial intelligence, empathy detection, physiological signal, (3 more...)

arXiv.org Artificial Intelligence

2311.00721

Genre: Research Report (0.40)

Industry: Health & Medicine (0.89)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (0.73)
Information Technology > Artificial Intelligence > Machine Learning (0.60)

Add feedback