AITopics | csa

Collaborating Authors

csa

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

In this Supplementary Material we first present details of the Shapley value sampling Appendix A)

Neural Information Processing SystemsAug-13-2025, 16:14:31 GMT

In this section, we introduce the details of the Shapley value sampling. We sample the Shapley value for models trained on CIFAR10, CIFAR100 and ImageNet. For CIFAR10 and CIFAR100, we employ ResNet-18 and train them ourselves. For ImageNet, we employ standard ResNet-50 provided in robustbench [11]. We demonstrate more Shapley V alue Quantification Results on ImageNet and TinyImageNet in Figure 1 and Figure 1.

artificial intelligence, machine learning, shapley value, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Game Theory (0.95)
Information Technology > Artificial Intelligence > Machine Learning (0.70)

Add feedback

FlashBack: Consistency Model-Accelerated Shared Autonomy

Sun, Luzhe, Ji, Jingtian, Tan, Xiangshan, Walter, Matthew R.

arXiv.org Artificial IntelligenceMay-28-2025

Shared autonomy is an enabling technology that provides users with control authority over robots that would otherwise be difficult if not impossible to directly control. Yet, standard methods make assumptions that limit their adoption in practice-for example, prior knowledge of the user's goals or the objective (i.e., reward) function that they wish to optimize, knowledge of the user's policy, or query-level access to the user during training. Diffusion-based approaches to shared autonomy do not make such assumptions and instead only require access to demonstrations of desired behaviors, while allowing the user to maintain control authority. However, these advantages have come at the expense of high computational complexity, which has made real-time shared autonomy all but impossible. To overcome this limitation, we propose Consistency Shared Autonomy (CSA), a shared autonomy framework that employs a consistency model-based formulation of diffusion. Key to CSA is that it employs the distilled probability flow of ordinary differential equations (PF ODE) to generate high-fidelity samples in a single step. This results in inference speeds significantly than what is possible with previous diffusion-based approaches to shared autonomy, enabling real-time assistance in complex domains with only a single function evaluation. Further, by intervening on flawed actions at intermediate states of the PF ODE, CSA enables varying levels of assistance. We evaluate CSA on a variety of challenging simulated and real-world robot control problems, demonstrating significant improvements over state-of-the-art methods both in terms of task performance and computational efficiency.

artificial intelligence, arxiv preprint arxiv, autonomy, (15 more...)

arXiv.org Artificial Intelligence

2505.16892

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre: Research Report (0.84)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Cite Before You Speak: Enhancing Context-Response Grounding in E-commerce Conversational LLM-Agents

Zeng, Jingying, Liu, Hui, Dai, Zhenwei, Tang, Xianfeng, Luo, Chen, Varshney, Samarth, Li, Zhen, He, Qi

arXiv.org Artificial IntelligenceMar-9-2025

With the advancement of conversational large language models (LLMs), several LLM-based Conversational Shopping Agents (CSA) have been developed to help customers answer questions and smooth their shopping journey in e-commerce domain. The primary objective in building a trustworthy CSA is to ensure the agent's responses are accurate and factually grounded, which is essential for building customer trust and encouraging continuous engagement. However, two challenges remain. First, LLMs produce hallucinated or unsupported claims. Such inaccuracies risk spreading misinformation and diminishing customer trust. Second, without providing knowledge source attribution in CSA response, customers struggle to verify LLM-generated information. To address these challenges, we present an easily productionized solution that enables a "citation experience" utilizing In-context Learning (ICL) and Multi-UX-Inference (MUI) to generate responses with citations to attribute its original sources without interfering other existing UX features. With proper UX design, these citation marks can be linked to the related product information and display the source to our customers. In this work, we also build auto-metrics and scalable benchmarks to holistically evaluate LLM's grounding and attribution capabilities. Our experiments demonstrate that incorporating this citation generation paradigm can substantially enhance the grounding of LLM responses by 13.83% on the real-world data. As such, our solution not only addresses the immediate challenges of LLM grounding issues but also adds transparency to conversational AI.

citation generation paradigm, information, llm, (13 more...)

arXiv.org Artificial Intelligence

2503.0483

Genre: Research Report (0.50)

Industry: Information Technology > Services > e-Commerce Services (0.71)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features

Li, Po-han, Chinchali, Sandeep P., Topcu, Ufuk

arXiv.org Artificial IntelligenceNov-25-2024

Multimodal encoders like CLIP excel in tasks such as zero-shot image classification and cross-modal retrieval. However, they require excessive training data. We propose canonical similarity analysis (CSA), which uses two unimodal encoders to replicate multimodal encoders using limited data. CSA maps unimodal features into a multimodal space, using a new similarity score to retain only the multimodal information. CSA only involves the inference of unimodal encoders and a cubic-complexity matrix decomposition, eliminating the need for extensive GPU-based model training. Experiments show that CSA outperforms CLIP while requiring $300,000\times$ fewer multimodal data pairs and $6\times$ fewer unimodal data for ImageNet classification and misinformative news captions detection. CSA surpasses the state-of-the-art method to map unimodal features to multimodal features. We also demonstrate the ability of CSA with modalities beyond image and text, paving the way for future modality pairs with limited paired multimodal data but abundant unpaired unimodal data, such as lidar and text.

csa, encoder, unimodal encoder, (15 more...)

arXiv.org Artificial Intelligence

2410.0761

Country:

North America > United States > Texas > Bexar County > San Antonio (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
North America > Canada > Ontario > Toronto (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.94)
Information Technology > Artificial Intelligence > Robots (0.93)
(3 more...)

Add feedback

CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks

Nikzad, Nick, Gao, Yongsheng, Zhou, Jun

arXiv.org Artificial IntelligenceMay-13-2024

In recent years, convolutional neural networks (CNNs) with channel-wise feature refining mechanisms have brought noticeable benefits to modelling channel dependencies. However, current attention paradigms fail to infer an optimal channel descriptor capable of simultaneously exploiting statistical and spatial relationships among feature maps. In this paper, to overcome this shortcoming, we present a novel channel-wise spatially autocorrelated (CSA) attention mechanism. Inspired by geographical analysis, the proposed CSA exploits the spatial relationships between channels of feature maps to produce an effective channel descriptor. To the best of our knowledge, this is the f irst time that the concept of geographical spatial analysis is utilized in deep CNNs. The proposed CSA imposes negligible learning parameters and light computational overhead to the deep model, making it a powerful yet efficient attention module of choice. We validate the effectiveness of the proposed CSA networks (CSA-Nets) through extensive experiments and analysis on ImageNet, and MS COCO benchmark datasets for image classification, object detection, and instance segmentation. The experimental results demonstrate that CSA-Nets are able to consistently achieve competitive performance and superior generalization than several state-of-the-art attention-based CNNs over different benchmark tasks and datasets.

attention module, autocorrelation, spatial autocorrelation, (14 more...)

arXiv.org Artificial Intelligence

2405.05755

Country: Oceania > Australia (0.04)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Details of the Shapley Value Sampling

Neural Information Processing SystemsFeb-18-2024, 05:20:30 GMT

Then we give more experimental results on CIFAR-100 and stability analysis of Shapley value (Appendix B). Finally, we add properties of the Shapley value and proof of decomposition of CNNs in frequency domain (Appendix D). In this section, we introduce the details of the Shapley value sampling. A.1 Details of the Model for the Shapley Value Sampling We sample the Shapley value for models trained on CIFAR10, CIFAR100 and ImageNet. For CIFAR10 and CIFAR100, we employ ResNet-18 and train them ourselves.

artificial intelligence, machine learning, shapley value, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.69)

Add feedback

Explainable Benchmarking for Iterative Optimization Heuristics

van Stein, Niki, Vermetten, Diederick, Kononova, Anna V., Bäck, Thomas

arXiv.org Artificial IntelligenceJan-31-2024

Traditional benchmarking methods are often used to evaluate algorithms in isolation, with a single algorithm configuration (hyper-parameter setting) or with a limited set of a few variations against a limited set of state-of-the-art algorithms, leading to limited insights into their comparative performance and practical applicability. This study addresses these challenges by employing modular optimization approaches and explainable AI techniques in order to derive insights into the algorithmic behaviour of a large set of algorithm components (modules) and their hyper-parameters. Modular optimization frameworks allow for the comparison of various modifications on a core algorithm, facilitating a deeper understanding of each component's influence on the algorithm's performance in different scenarios. There is already a wide variety of modular algorithm frameworks available, but their application for explicit explainability of the various algorithmic components and settings has been relatively unexplored. This paper aims to bridge this gap by providing a comprehensive framework for explainable benchmarking in iterative optimization heuristics and by providing a software library (IOH-Xplainer) to facilitate researchers to use the proposed framework.

algorithm, configuration, default, (14 more...)

arXiv.org Artificial Intelligence

2401.17842

Country:

Europe > Netherlands > South Holland > Leiden (0.05)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Virginia > Fairfax County > Fairfax (0.04)
(6 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Add feedback

Sluggish and Chemically-Biased Interstitial Diffusion in Concentrated Solid Solution Alloys: Mechanisms and Methods

Xu, Biao, Fu, Haijun, Huang, Shasha, Ma, Shihua, Xiong, Yaoxu, Zhang, Jun, Xiang, Xuepeng, Lu, Wenyu, Kai, Ji-Jung, Zhao, Shijun

arXiv.org Artificial IntelligenceNov-28-2023

Interstitial diffusion is a pivotal process that governs the phase stability and irradiation response of materials in non-equilibrium conditions. In this work, we study sluggish and chemically-biased interstitial diffusion in Fe-Ni concentrated solid solution alloys (CSAs) by combining machine learning (ML) and kinetic Monte Carlo (kMC), where ML is used to accurately and efficiently predict the migration energy barriers on-the-fly. The ML-kMC reproduces the diffusivity that was reported by molecular dynamics results at high temperatures. With this powerful tool, we find that the observed sluggish diffusion and the "Ni-Ni-Ni"-biased diffusion in Fe-Ni alloys are ascribed to a unique "Barrier Lock" mechanism, whereas the "Fe-Fe-Fe"-biased diffusion is influenced by a "Component Dominance" mechanism. Inspired by the mentioned mechanisms, a practical AvgS-kMC method is proposed for conveniently and swiftly determining interstitial-mediated diffusivity by only relying on the mean energy barriers of migration patterns. Combining the AvgS-kMC with the differential evolutionary algorithm, an inverse design strategy for optimizing sluggish diffusion properties is applied to emphasize the crucial role of favorable migration patterns.

artificial intelligence, evolutionary algorithm, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2311.16727

Country:

Asia > China > Hong Kong (0.05)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Causal-Based Supervision of Attention in Graph Neural Network: A Better and Simpler Choice towards Powerful Attention

Wang, Hongjun, Chen, Jiyuan, Du, Lun, Fu, Qiang, Han, Shi, Song, Xuan

arXiv.org Artificial IntelligenceJul-18-2023

Recent years have witnessed the great potential of attention mechanism in graph representation learning. However, while variants of attention-based GNNs are setting new benchmarks for numerous real-world datasets, recent works have pointed out that their induced attentions are less robust and generalizable against noisy graphs due to lack of direct supervision. In this paper, we present a new framework which utilizes the tool of causality to provide a powerful supervision signal for the learning process of attention functions. Specifically, we estimate the direct causal effect of attention to the final prediction, and then maximize such effect to guide attention attending to more meaningful neighbors. Our method can serve as a plug-and-play module for any canonical attention-based GNNs in an end-to-end fashion. Extensive experiments on a wide range of benchmark datasets illustrated that, by directly supervising attention functions, the model is able to converge faster with a clearer decision boundary, and thus yields better performances.

artificial intelligence, causal effect, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2305.13115

Country:

North America > United States > Texas (0.05)
North America > United States > Wisconsin (0.05)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(4 more...)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Classifying Mental-Disorders through Clinicians Subjective Approach based on Three-way Decision

Wang, Huidong, Sourav, Md Sakib Ullah, Yang, Mengdi, Zhang, Jiaping

arXiv.org Artificial IntelligenceApr-7-2023

Prevalence can be seen as having a lack of motivation to live, losing interest in everything among common people. Hence, they are frequently thriving towards psychiatric diagnosis than in the past days. Therefore, improper diagnosis of mental health disorders may lead to even more vulnerable consequences in a greater sense from an individual to a social perspective [38]. The traditional form of psychiatric diagnosis is much pretentious nowadays as few recent studies demonstrate several shortcomings within the widely established systems used for classifying mental disorders, namely, bipolar disorder, anxiety disorders, phobias, substance use disorder, mood disorders, and many others [2,3]. More often these recognized tools, such as DSM-5 [7] and ICD-11 [8], fails to distinguish between the proper and correct disorder diagnosis of a complex phenomenon in individual cases.

artificial intelligence, disorder, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2301.03351

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Maryland > Montgomery County > Rockville (0.04)
Asia > China > Shanghai > Shanghai (0.04)
(3 more...)

Genre: Research Report > New Finding (0.66)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Addiction Disorder (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)
Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback