csa
In this Supplementary Material we first present details of the Shapley value sampling Appendix A)
In this section, we introduce the details of the Shapley value sampling. We sample the Shapley value for models trained on CIFAR10, CIFAR100 and ImageNet. For CIFAR10 and CIFAR100, we employ ResNet-18 and train them ourselves. For ImageNet, we employ standard ResNet-50 provided in robustbench [11]. We demonstrate more Shapley V alue Quantification Results on ImageNet and TinyImageNet in Figure 1 and Figure 1.
FlashBack: Consistency Model-Accelerated Shared Autonomy
Sun, Luzhe, Ji, Jingtian, Tan, Xiangshan, Walter, Matthew R.
Shared autonomy is an enabling technology that provides users with control authority over robots that would otherwise be difficult if not impossible to directly control. Yet, standard methods make assumptions that limit their adoption in practice-for example, prior knowledge of the user's goals or the objective (i.e., reward) function that they wish to optimize, knowledge of the user's policy, or query-level access to the user during training. Diffusion-based approaches to shared autonomy do not make such assumptions and instead only require access to demonstrations of desired behaviors, while allowing the user to maintain control authority. However, these advantages have come at the expense of high computational complexity, which has made real-time shared autonomy all but impossible. To overcome this limitation, we propose Consistency Shared Autonomy (CSA), a shared autonomy framework that employs a consistency model-based formulation of diffusion. Key to CSA is that it employs the distilled probability flow of ordinary differential equations (PF ODE) to generate high-fidelity samples in a single step. This results in inference speeds significantly than what is possible with previous diffusion-based approaches to shared autonomy, enabling real-time assistance in complex domains with only a single function evaluation. Further, by intervening on flawed actions at intermediate states of the PF ODE, CSA enables varying levels of assistance. We evaluate CSA on a variety of challenging simulated and real-world robot control problems, demonstrating significant improvements over state-of-the-art methods both in terms of task performance and computational efficiency.
- North America > United States > Illinois > Cook County > Chicago (0.04)
- Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Cite Before You Speak: Enhancing Context-Response Grounding in E-commerce Conversational LLM-Agents
Zeng, Jingying, Liu, Hui, Dai, Zhenwei, Tang, Xianfeng, Luo, Chen, Varshney, Samarth, Li, Zhen, He, Qi
With the advancement of conversational large language models (LLMs), several LLM-based Conversational Shopping Agents (CSA) have been developed to help customers answer questions and smooth their shopping journey in e-commerce domain. The primary objective in building a trustworthy CSA is to ensure the agent's responses are accurate and factually grounded, which is essential for building customer trust and encouraging continuous engagement. However, two challenges remain. First, LLMs produce hallucinated or unsupported claims. Such inaccuracies risk spreading misinformation and diminishing customer trust. Second, without providing knowledge source attribution in CSA response, customers struggle to verify LLM-generated information. To address these challenges, we present an easily productionized solution that enables a "citation experience" utilizing In-context Learning (ICL) and Multi-UX-Inference (MUI) to generate responses with citations to attribute its original sources without interfering other existing UX features. With proper UX design, these citation marks can be linked to the related product information and display the source to our customers. In this work, we also build auto-metrics and scalable benchmarks to holistically evaluate LLM's grounding and attribution capabilities. Our experiments demonstrate that incorporating this citation generation paradigm can substantially enhance the grounding of LLM responses by 13.83% on the real-world data. As such, our solution not only addresses the immediate challenges of LLM grounding issues but also adds transparency to conversational AI.
CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features
Li, Po-han, Chinchali, Sandeep P., Topcu, Ufuk
Multimodal encoders like CLIP excel in tasks such as zero-shot image classification and cross-modal retrieval. However, they require excessive training data. We propose canonical similarity analysis (CSA), which uses two unimodal encoders to replicate multimodal encoders using limited data. CSA maps unimodal features into a multimodal space, using a new similarity score to retain only the multimodal information. CSA only involves the inference of unimodal encoders and a cubic-complexity matrix decomposition, eliminating the need for extensive GPU-based model training. Experiments show that CSA outperforms CLIP while requiring $300,000\times$ fewer multimodal data pairs and $6\times$ fewer unimodal data for ImageNet classification and misinformative news captions detection. CSA surpasses the state-of-the-art method to map unimodal features to multimodal features. We also demonstrate the ability of CSA with modalities beyond image and text, paving the way for future modality pairs with limited paired multimodal data but abundant unpaired unimodal data, such as lidar and text.
- North America > United States > Texas > Bexar County > San Antonio (0.04)
- Europe > Netherlands > North Holland > Amsterdam (0.04)
- North America > United States > Texas > Travis County > Austin (0.04)
- North America > Canada > Ontario > Toronto (0.04)
CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks
Nikzad, Nick, Gao, Yongsheng, Zhou, Jun
In recent years, convolutional neural networks (CNNs) with channel-wise feature refining mechanisms have brought noticeable benefits to modelling channel dependencies. However, current attention paradigms fail to infer an optimal channel descriptor capable of simultaneously exploiting statistical and spatial relationships among feature maps. In this paper, to overcome this shortcoming, we present a novel channel-wise spatially autocorrelated (CSA) attention mechanism. Inspired by geographical analysis, the proposed CSA exploits the spatial relationships between channels of feature maps to produce an effective channel descriptor. To the best of our knowledge, this is the f irst time that the concept of geographical spatial analysis is utilized in deep CNNs. The proposed CSA imposes negligible learning parameters and light computational overhead to the deep model, making it a powerful yet efficient attention module of choice. We validate the effectiveness of the proposed CSA networks (CSA-Nets) through extensive experiments and analysis on ImageNet, and MS COCO benchmark datasets for image classification, object detection, and instance segmentation. The experimental results demonstrate that CSA-Nets are able to consistently achieve competitive performance and superior generalization than several state-of-the-art attention-based CNNs over different benchmark tasks and datasets.
- Information Technology > Sensing and Signal Processing > Image Processing (1.00)
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
A Details of the Shapley Value Sampling
Then we give more experimental results on CIFAR-100 and stability analysis of Shapley value (Appendix B). Finally, we add properties of the Shapley value and proof of decomposition of CNNs in frequency domain (Appendix D). In this section, we introduce the details of the Shapley value sampling. A.1 Details of the Model for the Shapley Value Sampling We sample the Shapley value for models trained on CIFAR10, CIFAR100 and ImageNet. For CIFAR10 and CIFAR100, we employ ResNet-18 and train them ourselves.
Explainable Benchmarking for Iterative Optimization Heuristics
van Stein, Niki, Vermetten, Diederick, Kononova, Anna V., Bäck, Thomas
Traditional benchmarking methods are often used to evaluate algorithms in isolation, with a single algorithm configuration (hyper-parameter setting) or with a limited set of a few variations against a limited set of state-of-the-art algorithms, leading to limited insights into their comparative performance and practical applicability. This study addresses these challenges by employing modular optimization approaches and explainable AI techniques in order to derive insights into the algorithmic behaviour of a large set of algorithm components (modules) and their hyper-parameters. Modular optimization frameworks allow for the comparison of various modifications on a core algorithm, facilitating a deeper understanding of each component's influence on the algorithm's performance in different scenarios. There is already a wide variety of modular algorithm frameworks available, but their application for explicit explainability of the various algorithmic components and settings has been relatively unexplored. This paper aims to bridge this gap by providing a comprehensive framework for explainable benchmarking in iterative optimization heuristics and by providing a software library (IOH-Xplainer) to facilitate researchers to use the proposed framework.
- Europe > Netherlands > South Holland > Leiden (0.05)
- North America > United States > New York > New York County > New York City (0.04)
- North America > United States > Virginia > Fairfax County > Fairfax (0.04)
- (6 more...)
- Research Report (1.00)
- Overview (1.00)
Sluggish and Chemically-Biased Interstitial Diffusion in Concentrated Solid Solution Alloys: Mechanisms and Methods
Xu, Biao, Fu, Haijun, Huang, Shasha, Ma, Shihua, Xiong, Yaoxu, Zhang, Jun, Xiang, Xuepeng, Lu, Wenyu, Kai, Ji-Jung, Zhao, Shijun
Interstitial diffusion is a pivotal process that governs the phase stability and irradiation response of materials in non-equilibrium conditions. In this work, we study sluggish and chemically-biased interstitial diffusion in Fe-Ni concentrated solid solution alloys (CSAs) by combining machine learning (ML) and kinetic Monte Carlo (kMC), where ML is used to accurately and efficiently predict the migration energy barriers on-the-fly. The ML-kMC reproduces the diffusivity that was reported by molecular dynamics results at high temperatures. With this powerful tool, we find that the observed sluggish diffusion and the "Ni-Ni-Ni"-biased diffusion in Fe-Ni alloys are ascribed to a unique "Barrier Lock" mechanism, whereas the "Fe-Fe-Fe"-biased diffusion is influenced by a "Component Dominance" mechanism. Inspired by the mentioned mechanisms, a practical AvgS-kMC method is proposed for conveniently and swiftly determining interstitial-mediated diffusivity by only relying on the mean energy barriers of migration patterns. Combining the AvgS-kMC with the differential evolutionary algorithm, an inverse design strategy for optimizing sluggish diffusion properties is applied to emphasize the crucial role of favorable migration patterns.
Causal-Based Supervision of Attention in Graph Neural Network: A Better and Simpler Choice towards Powerful Attention
Wang, Hongjun, Chen, Jiyuan, Du, Lun, Fu, Qiang, Han, Shi, Song, Xuan
Recent years have witnessed the great potential of attention mechanism in graph representation learning. However, while variants of attention-based GNNs are setting new benchmarks for numerous real-world datasets, recent works have pointed out that their induced attentions are less robust and generalizable against noisy graphs due to lack of direct supervision. In this paper, we present a new framework which utilizes the tool of causality to provide a powerful supervision signal for the learning process of attention functions. Specifically, we estimate the direct causal effect of attention to the final prediction, and then maximize such effect to guide attention attending to more meaningful neighbors. Our method can serve as a plug-and-play module for any canonical attention-based GNNs in an end-to-end fashion. Extensive experiments on a wide range of benchmark datasets illustrated that, by directly supervising attention functions, the model is able to converge faster with a clearer decision boundary, and thus yields better performances.
- North America > United States > Texas (0.05)
- North America > United States > Wisconsin (0.05)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- (4 more...)
Classifying Mental-Disorders through Clinicians Subjective Approach based on Three-way Decision
Wang, Huidong, Sourav, Md Sakib Ullah, Yang, Mengdi, Zhang, Jiaping
Prevalence can be seen as having a lack of motivation to live, losing interest in everything among common people. Hence, they are frequently thriving towards psychiatric diagnosis than in the past days. Therefore, improper diagnosis of mental health disorders may lead to even more vulnerable consequences in a greater sense from an individual to a social perspective [38]. The traditional form of psychiatric diagnosis is much pretentious nowadays as few recent studies demonstrate several shortcomings within the widely established systems used for classifying mental disorders, namely, bipolar disorder, anxiety disorders, phobias, substance use disorder, mood disorders, and many others [2,3]. More often these recognized tools, such as DSM-5 [7] and ICD-11 [8], fails to distinguish between the proper and correct disorder diagnosis of a complex phenomenon in individual cases.
- North America > United States > New York > New York County > New York City (0.04)
- North America > United States > Maryland > Montgomery County > Rockville (0.04)
- Asia > China > Shanghai > Shanghai (0.04)
- (3 more...)