AITopics | Nguyen, Phuc

Collaborating Authors

Nguyen, Phuc

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Grade Inflation in Generative Models

Nguyen, Phuc, Li, Miao, Morgan, Alexandra, Arnaout, Rima, Arnaout, Ramy

arXiv.org Machine LearningJan-4-2025

Generative models hold great potential, but only if one can trust the evaluation of the data they generate. We show that many commonly used quality scores for comparing two-dimensional distributions of synthetic vs. ground-truth data give better results than they should, a phenomenon we call the "grade inflation problem." We show that the correlation score, Jaccard score, earth-mover's score, and Kullback-Leibler (relative-entropy) score all suffer grade inflation. We propose that any score that values all datapoints equally, as these do, will also exhibit grade inflation; we refer to such scores as "equipoint" scores. We introduce the concept of "equidensity" scores, and present the Eden score, to our knowledge the first example of such a score. We found that Eden avoids grade inflation and agrees better with human perception of goodness-of-fit than the equipoint scores above. We propose that any reasonable equidensity score will avoid grade inflation. We identify a connection between equidensity scores and R\'enyi entropy of negative order. We conclude that equidensity scores are likely to outperform equipoint scores for generative models, and for comparing low-dimensional distributions more generally.

data mining, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

2501.00664

Country: North America > United States > California > San Francisco County > San Francisco (0.28)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine > Therapeutic Area (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.82)
Information Technology > Data Science > Data Mining (0.68)

Add feedback

QCResUNet: Joint Subject-level and Voxel-level Segmentation Quality Prediction

Qiu, Peijie, Chakrabarty, Satrajit, Nguyen, Phuc, Ghosh, Soumyendu Sekhar, Sotiras, Aristeidis

arXiv.org Artificial IntelligenceDec-9-2024

Deep learning has made significant strides in automated brain tumor segmentation from magnetic resonance imaging (MRI) scans in recent years. However, the reliability of these tools is hampered by the presence of poor-quality segmentation outliers, particularly in out-of-distribution samples, making their implementation in clinical practice difficult. Therefore, there is a need for quality control (QC) to screen the quality of the segmentation results. Although numerous automatic QC methods have been developed for segmentation quality screening, most were designed for cardiac MRI segmentation, which involves a single modality and a single tissue type. Furthermore, most prior works only provided subject-level predictions of segmentation quality and did not identify erroneous parts segmentation that may require refinement. To address these limitations, we proposed a novel multi-task deep learning architecture, termed QCResUNet, which produces subject-level segmentation-quality measures as well as voxel-level segmentation error maps for each available tissue class. To validate the effectiveness of the proposed method, we conducted experiments on assessing its performance on evaluating the quality of two distinct segmentation tasks. First, we aimed to assess the quality of brain tumor segmentation results. For this task, we performed experiments on one internal and two external datasets. Second, we aimed to evaluate the segmentation quality of cardiac Magnetic Resonance Imaging (MRI) data from the Automated Cardiac Diagnosis Challenge. The proposed method achieved high performance in predicting subject-level segmentation-quality metrics and accurately identifying segmentation errors on a voxel basis. This has the potential to be used to guide human-in-the-loop feedback to improve segmentations in clinical settings.

artificial intelligence, machine learning, segmentation, (17 more...)

arXiv.org Artificial Intelligence

2412.07156

Country:

North America > United States (0.46)
Asia > China (0.28)

Genre: Research Report > New Finding (0.93)

Industry:

Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

$\textit{lucie}$: An Improved Python Package for Loading Datasets from the UCI Machine Learning Repository

Ge, Kenneth, Nguyen, Phuc, Arnaout, Ramy

arXiv.org Artificial IntelligenceOct-15-2024

The University of California--Irvine (UCI) Machine Learning (ML) Repository (UCIMLR) is consistently cited as one of the most popular dataset repositories, hosting hundreds of high-impact datasets. However, a significant portion, including 28.4% of the top 250, cannot be imported via the $\textit{ucimlrepo}$ package that is provided and recommended by the UCIMLR website. Instead, they are hosted as .zip files, containing nonstandard formats that are difficult to import without additional ad hoc processing. To address this issue, here we present $\textit{lucie}$ -- $\underline{l}oad$ $\underline{U}niversity$ $\underline{C}alifornia$ $\underline{I}rvine$ $\underline{e}xamples$ -- a utility that automatically determines the data format and imports many of these previously non-importable datasets, while preserving as much of a tabular data structure as possible. $\textit{lucie}$ was designed using the top 100 most popular datasets and benchmarked on the next 130, where it resulted in a success rate of 95.4% vs. 73.1% for $\textit{ucimlrepo}$. $\textit{lucie}$ is available as a Python package on PyPI with 98% code coverage.

artificial intelligence, dataset, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2410.09119

Country:

North America > United States > California > Orange County > Irvine (0.24)
Europe > United Kingdom > England > Oxfordshire (0.14)

Genre: Research Report (0.40)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.68)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding

Engelmann, Francis, Takmaz, Ayca, Schult, Jonas, Fedele, Elisabetta, Wald, Johanna, Peng, Songyou, Wang, Xi, Litany, Or, Tang, Siyu, Tombari, Federico, Pollefeys, Marc, Guibas, Leonidas, Tian, Hongbo, Wang, Chunjie, Yan, Xiaosheng, Wang, Bingwen, Zhang, Xuanyang, Liu, Xiao, Nguyen, Phuc, Nguyen, Khoi, Tran, Anh, Pham, Cuong, Huang, Zhening, Wu, Xiaoyang, Chen, Xi, Zhao, Hengshuang, Zhu, Lei, Lasenby, Joan

arXiv.org Artificial IntelligenceMar-17-2024

This report provides an overview of the challenge hosted at the OpenSUN3D Workshop on Open-Vocabulary 3D Scene Understanding held in conjunction with ICCV 2023. The goal of this workshop series is to provide a platform for exploration and discussion of open-vocabulary 3D scene understanding tasks, including but not limited to segmentation, detection and mapping. We provide an overview of the challenge hosted at the workshop, present the challenge dataset, the evaluation methodology, and brief descriptions of the winning methods. Additional details are available on the OpenSUN3D workshop website.

artificial intelligence, dataset, query, (15 more...)

arXiv.org Artificial Intelligence

2402.15321

Country: Asia > China (0.14)

Genre: Instructional Material > Course Syllabus & Notes (0.88)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.95)

Add feedback

An Unobtrusive and Lightweight Ear-worn System for Continuous Epileptic Seizure Detection

Aziz, Abdul, Pham, Nhat, Vora, Neel, Reynolds, Cody, Lehnen, Jaime, Venkatesh, Pooja, Yao, Zhuoran, Harvey, Jay, Vu, Tam, Ding, Kan, Nguyen, Phuc

arXiv.org Artificial IntelligenceJan-1-2024

Epilepsy is one of the most common neurological diseases globally, affecting around 50 million people worldwide. Fortunately, up to 70 percent of people with epilepsy could live seizure-free if properly diagnosed and treated, and a reliable technique to monitor the onset of seizures could improve the quality of life of patients who are constantly facing the fear of random seizure attacks. The scalp-based EEG test, despite being the gold standard for diagnosing epilepsy, is costly, necessitates hospitalization, demands skilled professionals for operation, and is discomforting for users. In this paper, we propose EarSD, a novel lightweight, unobtrusive, and socially acceptable ear-worn system to detect epileptic seizure onsets by measuring the physiological signals from behind the user's ears. EarSD includes an integrated custom-built sensing, computing, and communication PCB to collect and amplify the signals of interest, remove the noises caused by motion artifacts and environmental impacts, and stream the data wirelessly to the computer or mobile phone nearby, where data are uploaded to the host computer for further processing. We conducted both in-lab and in-hospital experiments with epileptic seizure patients who were hospitalized for seizure studies. The preliminary results confirm that EarSD can detect seizures with up to 95.3 percent accuracy by just using classical machine learning algorithms.

artificial intelligence, data quality, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2401.05425

Country:

North America > United States > Texas (0.29)
North America > United States > Massachusetts > Hampshire County > Amherst (0.14)

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (1.00)

Industry:

Health & Medicine > Therapeutic Area > Neurology > Epilepsy (1.00)
Health & Medicine > Therapeutic Area > Genetic Disease (1.00)

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

Add feedback

TabIQA: Table Questions Answering on Business Document Images

Nguyen, Phuc, Ly, Nam Tuan, Takeda, Hideaki, Takasu, Atsuhiro

arXiv.org Artificial IntelligenceMar-27-2023

Table answering questions from business documents has many challenges that require understanding tabular structures, cross-document referencing, and additional numeric computations beyond simple search queries. This paper introduces a novel pipeline, named TabIQA, to answer questions about business document images. TabIQA combines state-of-the-art deep learning techniques 1) to extract table content and structural information from images and 2) to answer various questions related to numerical data, text-based information, and complex queries from structured tables. The evaluation results on VQAonBD 2023 dataset demonstrate the effectiveness of TabIQA in achieving promising performance in answering table-related questions. The TabIQA repository is available at https://github.com/phucty/itabqa.

machine learning, question answering, recognition, (20 more...)

arXiv.org Artificial Intelligence

2303.14935

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.56)

Add feedback

TabEAno: Table to Knowledge Graph Entity Annotation

Nguyen, Phuc, Kertkeidkachorn, Natthawut, Ichise, Ryutaro, Takeda, Hideaki

arXiv.org Artificial IntelligenceOct-5-2020

In the Open Data era, a large number of table resources have been made available on the Web and data portals. However, it is difficult to directly utilize such data due to the ambiguity of entities, name variations, heterogeneous schema, missing, or incomplete metadata. To address these issues, we propose a novel approach, namely TabEAno, to semantically annotate table rows toward knowledge graph entities. Specifically, we introduce a "two-cells" lookup strategy bases on the assumption that there is an existing logical relation occurring in the knowledge graph between the two closed cells in the same row of the table. Despite the simplicity of the approach, TabEAno outperforms the state of the art approaches in the two standard datasets e.g, T2D, Limaye with, and in the large-scale Wikipedia tables dataset.

annotation, deep learning, neural network, (21 more...)

arXiv.org Artificial Intelligence

2010.01829

Country: Oceania > New Zealand (0.14)

Genre:

Research Report > Promising Solution (0.54)
Overview > Innovation (0.54)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

MTab: Matching Tabular Data to Knowledge Graph using Probability Models

Nguyen, Phuc, Kertkeidkachorn, Natthawut, Ichise, Ryutaro, Takeda, Hideaki

arXiv.org Artificial IntelligenceOct-1-2019

This paper presents the design of our system, namely MTab, for Semantic Web Challenge on Tabular Data to Knowledge Graph Matching (SemTab 2019). MTab combines the voting algorithm and the probability models to solve critical problems of the matching tasks.

neural network, probability, text processing, (22 more...)

arXiv.org Artificial Intelligence

1910.00246

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (0.71)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

EmbNum: Semantic labeling for numerical values with deep metric learning

Nguyen, Phuc, Nguyen, Khai, Ichise, Ryutaro, Takeda, Hideaki

arXiv.org Machine LearningJun-25-2018

Semantic labeling is a task of matching unknown data source to labeled data sources. The semantic labels could be properties, classes in knowledge bases or labeled data are manually annotated by domain experts. In this paper, we presentEmbNum, a novel approach to match numerical columns from different table data sources. We use a representation network architecture consisting of triplet network and convolutional neural network to learn a mapping function from numerical columns toa transformed space. In this space, the Euclidean distance can be used to measure "semantic similarity" of two columns. Our experiments onCity-Data and Open-Data demonstrate thatEmbNumachieves considerable improvements in comparison with the state-of-the-art methods in effectiveness and efficiency.

deep learning, neural network, numerical column, (22 more...)

arXiv.org Machine Learning

1807.01367

Country:

North America > United States (0.28)
Asia > Japan > Honshū > Kantō (0.14)

Genre:

Research Report > Promising Solution (0.54)
Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback