AITopics | Nguyen, An

Collaborating Authors

Nguyen, An

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

How Intermodal Interaction Affects the Performance of Deep Multimodal Fusion for Mixed-Type Time Series

Dietz, Simon, Altstidl, Thomas, Zanca, Dario, Eskofier, Björn, Nguyen, An

arXiv.org Artificial IntelligenceJun-21-2024

Mixed-type time series (MTTS) is a bimodal data type that is common in many domains, such as healthcare, finance, environmental monitoring, and social media. It consists of regularly sampled continuous time series and irregularly sampled categorical event sequences. The integration of both modalities through multimodal fusion is a promising approach for processing MTTS. However, the question of how to effectively fuse both modalities remains open. In this paper, we present a comprehensive evaluation of several deep multimodal fusion approaches for MTTS forecasting. Our comparison includes three fusion types (early, intermediate, and late) and five fusion methods (concatenation, weighted mean, weighted mean with correlation, gating, and feature sharing). We evaluate these fusion approaches on three distinct datasets, one of which was generated using a novel framework. This framework allows for the control of key data properties, such as the strength and direction of intermodal interactions, modality imbalance, and the degree of randomness in each modality, providing a more controlled environment for testing fusion approaches. Our findings show that the performance of different fusion approaches can be substantially influenced by the direction and strength of intermodal interactions. The study reveals that early and intermediate fusion approaches excel at capturing fine-grained and coarse-grained cross-modal features, respectively. These findings underscore the crucial role of intermodal interactions in determining the most effective fusion strategy for MTTS forecasting.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2406.15098

Country: Europe > Germany (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (0.34)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.73)

Add feedback

Mixture of Experts Meets Prompt-Based Continual Learning

Le, Minh, Nguyen, An, Nguyen, Huy, Nguyen, Trang, Pham, Trang, Van Ngo, Linh, Ho, Nhat

arXiv.org Artificial IntelligenceMay-22-2024

Exploiting the power of pre-trained models, prompt-based approaches stand out compared to other continual learning solutions in effectively preventing catastrophic forgetting, even with very few learnable parameters and without the need for a memory buffer. While existing prompt-based continual learning methods excel in leveraging prompts for state-of-the-art performance, they often lack a theoretical explanation for the effectiveness of prompting. This paper conducts a theoretical analysis to unravel how prompts bestow such advantages in continual learning, thus offering a new perspective on prompt design. We first show that the attention block of pre-trained models like Vision Transformers inherently encodes a special mixture of experts architecture, characterized by linear experts and quadratic gating score functions. This realization drives us to provide a novel view on prefix tuning, reframing it as the addition of new task-specific experts, thereby inspiring the design of a novel gating mechanism termed Non-linear Residual Gates (NoRGa). Through the incorporation of non-linear activation and residual connection, NoRGa enhances continual learning performance while preserving parameter efficiency. The effectiveness of NoRGa is substantiated both theoretically and empirically across diverse benchmarks and pretraining paradigms.

artificial intelligence, exp, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2405.14124

Country: North America > United States > Texas (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Education (0.86)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Add feedback

Identifying Untrustworthy Predictions in Neural Networks by Geometric Gradient Analysis

Schwinn, Leo, Nguyen, An, Raab, René, Bungert, Leon, Tenbrinck, Daniel, Zanca, Dario, Burger, Martin, Eskofier, Bjoern

arXiv.org Machine LearningFeb-24-2021

The susceptibility of deep neural networks to untrustworthy predictions, including out-of-distribution (OOD) data and adversarial examples, still prevent their widespread use in safety-critical applications. Most existing methods either require a re-training of a given model to achieve robust identification of adversarial attacks or are limited to out-of-distribution sample detection only. In this work, we propose a geometric gradient analysis (GGA) to improve the identification of untrustworthy predictions without retraining of a given model. GGA analyzes the geometry of the loss landscape of neural networks based on the saliency maps of their respective input. To motivate the proposed approach, we provide theoretical connections between gradients' geometrical properties and local minima of the loss function. Furthermore, we demonstrate that the proposed method outperforms prior approaches in detecting OOD data and adversarial attacks, including state-of-the-art and adaptive attacks.

adversarial attack, deep learning, neural network, (19 more...)

arXiv.org Machine Learning

2102.12196

Country: Europe > Germany (0.14)

Genre: Research Report (0.64)

Industry: Information Technology (0.58)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.69)

Add feedback

System Design for a Data-driven and Explainable Customer Sentiment Monitor

Nguyen, An, Foerstel, Stefan, Kittler, Thomas, Kurzyukov, Andrey, Schwinn, Leo, Zanca, Dario, Hipp, Tobias, Sun, Da Jun, Schrapp, Michael, Rothgang, Eva, Eskofier, Bjoern

arXiv.org Artificial IntelligenceJan-11-2021

The most important goal of customer services is to keep the customer satisfied. However, service resources are always limited and must be prioritized. Therefore, it is important to identify customers who potentially become unsatisfied and might lead to escalations. Today this prioritization of customers is often done manually. Data science on IoT data (esp. log data) for machine health monitoring, as well as analytics on enterprise data for customer relationship management (CRM) have mainly been researched and applied independently. In this paper, we present a framework for a data-driven decision support system which combines IoT and enterprise data to model customer sentiment. Such decision support systems can help to prioritize customers and service resources to effectively troubleshoot problems or even avoid them. The framework is applied in a real-world case study with a major medical device manufacturer. This includes a fully automated and interpretable machine learning pipeline designed to meet the requirements defined with domain experts and end users. The overall framework is currently deployed, learns and evaluates predictive models from terabytes of IoT and enterprise data to actively monitor the customer sentiment for a fleet of thousands of high-end medical devices. Furthermore, we provide an anonymized industrial benchmark dataset for the research community.

customer, deep learning, neural network, (23 more...)

arXiv.org Artificial Intelligence

2101.04086

Country: North America > United States > New York (0.14)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Health Care Technology (0.69)
Health & Medicine > Health Care Equipment & Supplies (0.55)
Energy > Oil & Gas (0.49)
(2 more...)

Technology:

Information Technology > Enterprise Applications (1.00)
Information Technology > Decision Support Systems (1.00)
Information Technology > Data Science > Data Mining (1.00)
(4 more...)

Add feedback

Time Matters: Time-Aware LSTMs for Predictive Business Process Monitoring

Nguyen, An, Chatterjee, Srijeet, Weinzierl, Sven, Schwinn, Leo, Matzner, Martin, Eskofier, Bjoern

arXiv.org Artificial IntelligenceNov-5-2020

Predictive business process monitoring (PBPM) aims to predict future process behavior during ongoing process executions based on event log data. Especially, techniques for the next activity and timestamp prediction can help to improve the performance of operational business processes. Recently, many PBPM solutions based on deep learning were proposed by researchers. Due to the sequential nature of event log data, a common choice is to apply recurrent neural networks with long short-term memory (LSTM) cells. We argue, that the elapsed time between events is informative. However, current PBPM techniques mainly use 'vanilla' LSTM cells and hand-crafted time-related control flow features. To better model the time dependencies between events, we propose a new PBPM technique based on time-aware LSTM (T-LSTM) cells. T-LSTM cells incorporate the elapsed time between consecutive events inherently to adjust the cell memory. Furthermore, we introduce cost-sensitive learning to account for the common class imbalance in event logs. Our experiments on publicly available benchmark event logs indicate the effectiveness of the introduced techniques.

deep learning, neural network, prediction, (20 more...)

arXiv.org Artificial Intelligence

2010.00889

Country: Europe > Germany (0.14)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Conformance Checking for a Medical Training Process Using Petri net Simulation and Sequence Alignment

Nguyen, An, Zhang, Wenyu, Schwinn, Leo, Eskofier, Bjoern

arXiv.org Artificial IntelligenceOct-21-2020

Process Mining has recently gained popularity in healthcare due to its potential to provide a transparent, objective and data-based view on processes. Conformance checking is a sub-discipline of process mining that has the potential to answer how the actual process executions deviate from existing guidelines. In this work, we analyze a medical training process for a surgical procedure. Ten students were trained to install a Central Venous Catheters (CVC) with ultrasound. Event log data was collected directly after instruction by the supervisors during a first test run and additionally after a subsequent individual training phase. In order to provide objective performance measures, we formulate an optimal, global sequence alignment problem inspired by approaches in bioinformatics. Therefore, we use the Petri net model representation of the medical process guideline to simulate a representative set of guideline conform sequences. Next, we calculate the optimal, global sequence alignment of the recorded and simulated event logs. Finally, the output measures and visualization of aligned sequences are provided for objective feedback.

educational setting, professional school, sequence, (18 more...)

arXiv.org Artificial Intelligence

2010.11719

Country: Europe > Germany (0.14)

Genre:

Research Report (0.64)
Workflow (0.46)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Education > Educational Setting > Higher Education (0.61)
Education > Curriculum > Subject-Specific Education (0.61)

Technology:

Information Technology > Biomedical Informatics > Translational Bioinformatics (0.83)
Information Technology > Artificial Intelligence > Machine Learning (0.71)

Add feedback