AITopics | Pan, Deng

Collaborating Authors

Pan, Deng

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Fast Explainability via Feasible Concept Sets Generator

Pan, Deng, Moniz, Nuno, Chawla, Nitesh

arXiv.org Artificial IntelligenceMay-28-2024

A long-standing dilemma prevents the broader application of explanation methods: general applicability and inference speed. On the one hand, existing model-agnostic explanation methods usually make minimal pre-assumptions about the prediction models to be explained. Still, they require additional queries to the model through propagation or back-propagation to approximate the models' behaviors, resulting in slow inference and hindering their use in time-sensitive tasks. On the other hand, various model-dependent explanations have been proposed that achieve low-cost, fast inference but at the expense of limiting their applicability to specific model structures. In this study, we bridge the gap between the universality of model-agnostic approaches and the efficiency of model-specific approaches by proposing a novel framework without assumptions on the prediction model's structures, achieving high efficiency during inference and allowing for real-time explanations. To achieve this, we first define explanations through a set of human-comprehensible concepts and propose a framework to elucidate model predictions via minimal feasible concept sets. Second, we show that a minimal feasible set generator can be learned as a companion explainer to the prediction model, generating explanations for predictions. Finally, we validate this framework by implementing a novel model-agnostic method that provides robust explanations while facilitating real-time inference. Our claims are substantiated by comprehensive experiments, highlighting the effectiveness and efficiency of our approach.

explanation, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2405.18664

Country: North America > United States (0.14)

Genre: Research Report > New Finding (0.35)

Industry: Information Technology (0.93)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

Negative Flux Aggregation to Estimate Feature Attributions

Li, Xin, Pan, Deng, Li, Chengyin, Qiang, Yao, Zhu, Dongxiao

arXiv.org Artificial IntelligenceMay-13-2023

Gradient based methods such as Saliency Map [Simonyan Due to multi-layer nonlinearity of the deep neural et al., 2013], SmoothGrad [Smilkov et al., 2017], Full-network architectures, explaining DNN predictions Grad [Srinivas and Fleuret, 2019], Integrated Gradient (IG) still remains as an open problem, preventing and its variants [Sundararajan et al., 2017; Hesse et al., 2021; us from gaining a deeper understanding of the mechanisms. Erion et al., 2021; Pan et al., 2021; Kapishnikov et al., 2019; To enhance the explainability of DNNs, Kapishnikov et al., 2021] require neither surrogates nor customized we estimate the input feature's attributions to the rules but must tackle unstable estimates of gradients prediction task using divergence and flux. Inspired w.r.t. the given inputs. IG type of path integration based by the divergence theorem in vector analysis, we methods mitigate this issue via a path integration for gradient develop a novel Negative Flux Aggregation (Ne-smoothing, however, this also introduces another degree FLAG) formulation and an efficient approximation of instability and noise sourced from arbitrary selections of algorithm to estimate attribution map.

artificial intelligence, gradient, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2301.06989

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.66)

Add feedback

Decentralized federated learning methods for reducing communication cost and energy consumption in UAV networks

Pan, Deng, Khoshkholghi, Mohammad Ali, Mahmoodi, Toktam

arXiv.org Artificial IntelligenceApr-13-2023

Unmanned aerial vehicles (UAV) or drones play many roles in a modern smart city such as the delivery of goods, mapping real-time road traffic and monitoring pollution. The ability of drones to perform these functions often requires the support of machine learning technology. However, traditional machine learning models for drones encounter data privacy problems, communication costs and energy limitations. Federated Learning, an emerging distributed machine learning approach, is an excellent solution to address these issues. Federated learning (FL) allows drones to train local models without transmitting raw data. However, existing FL requires a central server to aggregate the trained model parameters of the UAV. A failure of the central server can significantly impact the overall training. In this paper, we propose two aggregation methods: Commutative FL and Alternate FL, based on the existing architecture of decentralised Federated Learning for UAV Networks (DFL-UN) by adding a unique aggregation method of decentralised FL. Those two methods can effectively control energy consumption and communication cost by controlling the number of local training epochs, local communication, and global communication. The simulation results of the proposed training methods are also presented to verify the feasibility and efficiency of the architecture compared with two benchmark methods (e.g. standard machine learning training and standard single aggregation server training). The simulation results show that the proposed methods outperform the benchmark methods in terms of operational stability, energy consumption and communication cost.

artificial intelligence, drone, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2304.06551

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (0.86)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.67)

Add feedback

Learning Compact Features via In-Training Representation Alignment

Li, Xin, Li, Xiangrui, Pan, Deng, Qiang, Yao, Zhu, Dongxiao

arXiv.org Artificial IntelligenceNov-23-2022

Deep neural networks (DNNs) for supervised learning can be viewed as a pipeline of the feature extractor (i.e., last hidden layer) and a linear classifier (i.e., output layer) that are trained jointly with stochastic gradient descent (SGD) on the loss function (e.g., cross-entropy). In each epoch, the true gradient of the loss function is estimated using a mini-batch sampled from the training set and model parameters are then updated with the mini-batch gradients. Although the latter provides an unbiased estimation of the former, they are subject to substantial variances derived from the size and number of sampled mini-batches, leading to noisy and jumpy updates. To stabilize such undesirable variance in estimating the true gradients, we propose In-Training Representation Alignment (ITRA) that explicitly aligns feature distributions of two different mini-batches with a matching loss in the SGD training process. We also provide a rigorous analysis of the desirable effects of the matching loss on feature representation learning: (1) extracting compact feature representation; (2) reducing over-adaption on mini-batches via an adaptive weighting mechanism; and (3) accommodating to multi-modalities. Finally, we conduct large-scale experiments on both image and text classifications to demonstrate its superior performance to the strong baselines.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2211.13332

Country: North America > United States (0.46)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.69)

Add feedback

Explainable Recommendation via Interpretable Feature Mapping and Evaluation of Explainability

Pan, Deng, Li, Xiangrui, Li, Xin, Zhu, Dongxiao

arXiv.org Machine LearningJul-12-2020

Latent factor collaborative filtering (CF) has been a widely used technique for recommender system by learning the semantic representations of users and items. Recently, explainable recommendation has attracted much attention from research community. However, trade-off exists between explainability and performance of the recommendation where metadata is often needed to alleviate the dilemma. We present a novel feature mapping approach that maps the uninterpretable general features onto the interpretable aspect features, achieving both satisfactory accuracy and explainability in the recommendations by simultaneous minimization of rating prediction loss and interpretation loss. To evaluate the explainability, we propose two new evaluation metrics specifically designed for aspect-level explanation using surrogate ground truth. Experimental results demonstrate a strong performance in both recommendation and explaining explanation, eliminating the need for metadata. Code is available from https://github.com/pd90506/AMCF.

artificial intelligence, natural language, recommendation, (19 more...)

arXiv.org Machine Learning

doi: 10.24963/ijcai.2020/373

2007.06133

Country: North America > United States (0.14)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback