AITopics

2408.01662

Country:

North America > United States (0.14)
Europe > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)
Asia > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)
Asia > China > Shaanxi Province > Xi'an (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.94)
Health & Medicine > Epidemiology (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.41)

Wang, Zeheng, van der Laan, Timothy, Usman, Muhammad

Quantum Kernel Principal Components Analysis for Compact Readout of Chemiresistive Sensor Arrays

arXiv.org Artificial IntelligenceAug-28-2024

The rapid growth of Internet of Things (IoT) devices necessitates efficient data compression techniques to handle the vast amounts of data generated by these devices. In this context, chemiresistive sensor arrays (CSAs), a simple-to-fabricate but crucial component in IoT systems, generate large volumes of data due to their simultaneous multi-sensor operations. Classical principal component analysis (cPCA) methods, a common solution to the data compression challenge, face limitations in preserving critical information during dimensionality reduction. In this study, we present quantum principal component analysis (qPCA) as a superior alternative to enhance information retention. Our findings demonstrate that qPCA outperforms cPCA in various back-end machine-learning modeling tasks, particularly in low-dimensional scenarios when limited Quantum bits (qubits) can be accessed. These results underscore the potential of noisy intermediate-scale quantum (NISQ) computers, despite current qubit limitations, to revolutionize data processing in real-world IoT applications, particularly in enhancing the efficiency and reliability of CSA data compression and readout.

dimension, kernel, reduction, (15 more...)

2409.00115

Country:

Oceania > Australia (0.05)
North America > United States (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology > Smart Houses & Appliances (0.55)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.81)

arXiv.org Artificial IntelligenceAug-22-2024

Robust Principal Component Analysis via Discriminant Sample Weight Learning

Deng, Yingzhuo, Hu, Ke, Li, Bo, Zhang, Yao

Principal component analysis (PCA) is a classical feature extraction method, but it may be adversely affected by outliers, resulting in inaccurate learning of the projection matrix. This paper proposes a robust method to estimate both the data mean and the PCA projection matrix by learning discriminant sample weights from data containing outliers. Each sample in the dataset is assigned a weight, and the proposed algorithm iteratively learns the weights, the mean, and the projection matrix, respectively. Specifically, when the mean and the projection matrix are available, via fine-grained analysis of outliers, a weight for each sample is learned hierarchically so that outliers have small weights while normal samples have large weights. With the learned weights available, a weighted optimization problem is solved to estimate both the data mean and the projection matrix. Because the learned weights discriminate outliers from normal samples, the adverse influence of outliers is mitigated due to the corresponding small weights. Experiments on toy data, UCI dataset, and face dataset demonstrate the effectiveness of the proposed method in estimating the mean and the projection matrix from the data containing outliers.

algorithm, outlier, projection matrix, (15 more...)

2408.12366

Country:

Asia > China > Heilongjiang Province > Harbin (0.04)
North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Hampshire > Southampton (0.04)

Genre: Research Report (0.40)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.61)

Namdari, Jamshid, Manatunga, Amita, Ferrarelli, Fabio, Krafty, Robert

Localized Sparse Principal Component Analysis of Multivariate Time Series in Frequency Domain

arXiv.org Machine LearningAug-15-2024

Since its first descriptions by Pearson (1901) and by Hotelling (1933), principal component analysis (PCA) has been one of the main multivariate analysis techniques for dimension reduction and feature extraction. PCA has become an essential tool for not just independent and identically distributed (iid) multivariate data, but also for serially correlated multivariate time series data in both the time and frequency domains. In the frequency domain, PCA as a sequential method for finding directions of maximum variability appeared in the work of Brillinger (1964) and Goodman (1967). Brillinger (1969) formulated the principal component series through an optimal linear filtering that transmit a p-dimensional signal through a d-dimensional channel and recovers it with minimum loss of information. A foundational discussion of theory and applications of PCA in frequency domain can be found in Brillinger (2001); recent applications of this framework include uncovering non-coherent block structures (Sundararajan, 2021), time-frequency analysis (Ombao et al., 2005) and change point detection (Jiao et al., 2021). PCA for the frequency domain analysis of high-dimensional multivariate time series faces several challenges. The first challenge, which is not unique to frequency domain PCA and is a challenge for PCA in general, is high-dimensionality. When the dimension is fixed, sample eigenvectors, and consequently sample estimates of the principal components, are consistent and asymptotically normally distributed (Anderson, 1958). However, in highdimensional regimes, where the dimension of the random variable grows, sample PCs fail to be consistent.

frequency, principal subspace, subspace, (13 more...)

2408.08177

Country:

North America > United States > New York (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
Europe > United Kingdom > Wales > Cardiff (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Health Care Providers & Services (0.67)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.61)

Teo, Rachel S. Y., Nguyen, Tan M.

Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis

arXiv.org Machine LearningJun-19-2024

The remarkable success of transformers in sequence modeling tasks, spanning various applications in natural language processing and computer vision, is attributed to the critical role of self-attention. Similar to the development of most deep learning models, the construction of these attention mechanisms rely on heuristics and experience. In our work, we derive self-attention from kernel principal component analysis (kernel PCA) and show that self-attention projects its query vectors onto the principal component axes of its key matrix in a feature space. We then formulate the exact formula for the value matrix in self-attention, theoretically and empirically demonstrating that this value matrix captures the eigenvectors of the Gram matrix of the key vectors in self-attention. Leveraging our kernel PCA framework, we propose Attention with Robust Principal Components (RPC-Attention), a novel class of robust attention that is resilient to data contamination. We empirically demonstrate the advantages of RPC-Attention over softmax attention on the ImageNet-1K object classification, WikiText-103 language modeling, and ADE20K image segmentation task.

rpc-attention, rpc-symvit, transformer, (16 more...)

2406.13762

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Italy > Tuscany > Florence (0.04)
Asia > China > Hong Kong (0.04)
(3 more...)

Genre: Research Report (0.82)

Industry:

Information Technology (0.68)
Government (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.61)

arXiv.org Artificial IntelligenceJun-5-2024

Randomized Principal Component Analysis for Hyperspectral Image Classification

Ustuner, Mustafa

The high-dimensional feature space of the hyperspectral imagery poses major challenges to the processing and analysis of the hyperspectral data sets. In such a case, dimensionality reduction is necessary to decrease the computational complexity. The random projections open up new ways of dimensionality reduction, especially for large data sets. In this paper, the principal component analysis (PCA) and randomized principal component analysis (R-PCA) for the classification of hyperspectral images using support vector machines (SVM) and light gradient boosting machines (LightGBM) have been investigated. In this experimental research, the number of features was reduced to 20 and 30 for classification of two hyperspectral datasets (Indian Pines and Pavia University). The experimental results demonstrated that PCA outperformed R-PCA for SVM for both datasets, but received close accuracy values for LightGBM. The highest classification accuracies were obtained as 0.9925 and 0.9639 by LightGBM with original features for the Pavia University and Indian Pines, respectively.

classification, hyperspectral image classification, reduction, (13 more...)

doi: 10.1109/M2GARSS57310.2024.10537329

2403.09117

Country: Asia > Middle East > Republic of Türkiye > Artvin Province > Artvin (0.05)

Genre: Research Report > Experimental Study (0.68)

Industry: Energy (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.81)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.70)

Giuliani, Pablo, Godbey, Kyle, Kejzlar, Vojtech, Nazarewicz, Witold

Model orthogonalization and Bayesian forecast mixing via Principal Component Analysis

arXiv.org Machine LearningMay-17-2024

One can improve predictability in the unknown domain by combining forecasts of imperfect complex computational models using a Bayesian statistical machine learning framework. In many cases, however, the models used in the mixing process are similar. In addition to contaminating the model space, the existence of such similar, or even redundant, models during the multimodeling process can result in misinterpretation of results and deterioration of predictive performance. In this work we describe a method based on the Principal Component Analysis that eliminates model redundancy. We show that by adding model orthogonalization to the proposed Bayesian Model Combination framework, one can arrive at better prediction accuracy and reach excellent uncertainty quantification performance.

doi, forecast, principal component, (15 more...)

2405.10839

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > Michigan > Ingham County > Lansing (0.04)
North America > United States > Michigan > Ingham County > East Lansing (0.04)
(4 more...)

Genre: Research Report (1.00)

Industry:

Energy (0.46)
Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.61)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.50)

arXiv.org Artificial IntelligenceApr-15-2024

Computer aided diagnosis system for Alzheimers disease using principal component analysis and machine learning based approaches

Lazli, Lilia

Alzheimer's disease is a severe neurological brain disorder. It is not curable, but earlier detection can help improve symptoms in a great deal. The machine learning-based approaches are popular and well-motivated models for many medical image processing tasks such as computer-aided diagnosis. These techniques can vastly improve the process for accurate diagnosis of Alzheimer's disease. In this paper, we investigate the performance of these techniques for Alzheimer's disease detection and classification using brain MRI and PET images from the OASIS database. The proposed system takes advantage of the powerful artificial neural network and support vector machines as classifiers, as well as principal component analysis as a feature extraction technique. The results indicate that the combined scheme achieves good accuracy and offers a significant advantage over the other approaches.

cad system, component analysis, validation, (12 more...)

2405.09553

Country: North America > Canada > Quebec > Montreal (0.05)

Genre: Research Report (0.40)

Industry: Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.62)

arXiv.org Artificial IntelligenceMar-27-2024

Implementation of the Principal Component Analysis onto High-Performance Computer Facilities for Hyperspectral Dimensionality Reduction: Results and Comparisons

Martel, E., Lazcano, R., Lopez, J., Madroñal, D., Salvador, R., Lopez, S., Juarez, E., Guerra, R., Sanz, C., Sarmiento, R.

Dimensionality reduction represents a critical preprocessing step in order to increase the efficiency and the performance of many hyperspectral imaging algorithms. However, dimensionality reduction algorithms, such as the Principal Component Analysis (PCA), suffer from their computationally demanding nature, becoming advisable for their implementation onto high-performance computer architectures for applications under strict latency constraints. This work presents the implementation of the PCA algorithm onto two different high-performance devices, namely, an NVIDIA Graphics Processing Unit (GPU) and a Kalray manycore, uncovering a highly valuable set of tips and tricks in order to take full advantage of the inherent parallelism of these high-performance computing platforms, and hence, reducing the time that is required to process a given hyperspectral image. Moreover, the achieved results obtained with different hyperspectral images have been compared with the ones that were obtained with a field programmable gate array (FPGA)-based implementation of the PCA algorithm that has been recently published, providing, for the first time in the literature, a comprehensive analysis in order to highlight the pros and cons of each option.

algorithm, implementation, matrix, (15 more...)

doi: 10.3390/rs10060864

2403.18321

Country:

Europe > Spain > Canary Islands > Gran Canaria > Las Palmas de Gran Canaria (0.14)
North America > United States > New Jersey > Hudson County > Hoboken (0.04)
Europe > Spain > Galicia > Madrid (0.04)
(13 more...)

Genre:

Workflow (0.87)
Research Report (0.63)

Industry:

Information Technology (0.89)
Government > Regional Government (0.67)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Dimensionality Reduction (0.81)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.61)

Neural Information Processing SystemsMar-15-2024, 14:06:50 GMT

Demixed Principal Component Analysis

In many experiments, the data points collected live in high-dimensional observation spaces, yet can be assigned a set of labels or parameters. In electrophysiological recordings, for instance, the responses of populations of neurons generally depend on mixtures of experimentally controlled parameters. The heterogeneity and diversity of these parameter dependencies can make visualization and interpretation of such data extremely difficult. Standard dimensionality reduction techniques such as principal component analysis (PCA) can provide a succinct and complete description of the data, but the description is constructed independent of the relevant task variables and is often hard to interpret. Here, we start with the assumption that a particularly informative description is one that reveals the dependency of the high-dimensional data on the individual parameters. We show how to modify the loss function of PCA so that the principal components seek to capture both the maximum amount of variance about the data, while also depending on a minimum number of parameters. We call this method demixed principal component analysis (dPCA) as the principal components here segregate the parameter dependencies. We phrase the problem as a probabilistic graphical model, and present a fast Expectation-Maximization (EM) algorithm. We demonstrate the use of this algorithm for electrophysiological data and show that it serves to demix the parameter-dependence of a neural population response.

matrix, principal component analysis, variance, (13 more...)

Neural Information Processing Systems

Country:

Europe > Portugal > Lisbon > Lisbon (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
(2 more...)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.82)