AITopics | Principal Component Analysis

Soheil Feizi and David Tse Stanford University Abstract In the era of big data, reducing data dimensionality is critical in many areas of science. Widely used Principal Component Analysis (PCA) addresses this problem by computing a low dimensional data embedding that maximally explain variance of the data. However, PCA has two major weaknesses. Firstly, it only considers linear correlations among variables (features), and secondly it is not suitable for categorical data. We resolve these issues by proposing Maximally Correlated Principal Component Analysis (MCPCA). MCPCA computes transformations of variables whose covariance matrix has the largest Ky Fan norm. Variable transformations are unknown, can be nonlinear and are computed in an optimization. MCPCA can also be viewed as a multivariate extension of Maximal Correlation. For jointly Gaussian variables we show that the covariance matrix corresponding to the identity (or the negative of the identity) transformations majorizes covariance matrices of non-identity functions. Using this result we characterize global MCPCA optimizers for nonlinear functions of jointly Gaussian variables for every rank constraint. For categorical variables we characterize global MCPCA optimizers for the rank one constraint based on the leading eigenvector of a matrix computed using pairwise joint distributions. For a general rank constraint we propose a block coordinate descend algorithm and show its convergence to stationary points of the MCPCA optimization. We compare MCPCA with PCA and other state-of-the-art dimensionality reduction methods including Isomap, LLE, multilayer autoencoders (neural networks), kernel PCA, probabilistic PCA and diffusion maps on several synthetic and real datasets. We show that MCPCA consistently provides improved performance compared to other methods. 1 Introduction Let X 1 and X 2 be two mean zero and unit variance random variables. Pearson's correlation [1] defined as ρ Pearson(X 1,X 2) E [X 1X 2 ] (1.1) is a basic statistical parameter and plays a central role in many statistical and machine learning methods such as linear regression [2], principal component analysis [3], and support vector machines [4], partially owing to its simplicity and computational efficiency. Pearson's correlation however has two main weaknesses: firstly it only captures linear dependency between variables, and secondly for discrete (categorical) variables the value of Pearson's correlation depends somewhat arbitrarily on the labels. To overcome these weaknesses, Maximal Correlation (MC) has been proposed and 1 arXiv:1702.05471v2 MC tackles the two main drawbacks of the Pearson's correlation: it models a family of nonlinear relationships between the two variables.

artificial intelligence, machine learning, optimization, (13 more...)

arXiv.org Machine Learning

1702.05471

Country: North America (0.28)

Genre: Research Report (0.50)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (1.00)

Add feedback

Policy Search with High-Dimensional Context Variables

Tangkaratt, Voot (The University of Tokyo) | Hoof, Herke van (McGill University) | Parisi, Simone (Technical University of Darmstadt) | Neumann, Gerhard (University of Lincoln) | Peters, Jan (Max Planck Institute for Intelligent Systems) | Sugiyama, Masashi (The University of Tokyo)

AAAI ConferencesFeb-14-2017

Direct contextual policy search methods learn to improve policy parameters and simultaneously generalize these parameters to different context or task variables. However, learning from high-dimensional context variables, such as camera images, is still a prominent problem in many real-world tasks. A naive application of unsupervised dimensionality reduction methods to the context variables, such as principal component analysis, is insufficient as task-relevant input may be ignored. In this paper, we propose a contextual policy search method in the model-based relative entropy stochastic search framework with integrated dimensionality reduction. We learn a model of the reward that is locally quadratic in both the policy parameters and the context variables. Furthermore, we perform supervised linear dimensionality reduction on the context variables by nuclear norm regularization. The experimental results show that the proposed method outperforms naive dimensionality reduction via principal component analysis and a state-of-the-art contextual policy search method.

artificial intelligence, machine learning, principal component analysis, (16 more...)

AAAI Conferences

Thirty-First AAAI Conference on Artificial Intelligence

Country:

Europe (0.95)
North America (0.93)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.45)

Add feedback

Can You Use Principal Component Analysis with a Training Set Test Set Model?

#artificialintelligenceJan-21-2017, 01:20:18 GMT

I recently gave a free webinar on Principal Component Analysis. We had almost 300 researchers attend and didn't get through all the questions. This is part of a series of answers to those questions. If you missed it, you can get the webinar recording here. Principal Component Analysis specifically could be used with a training and test data set, but it doesn't make as much sense as doing so for Factor Analysis.

artificial intelligence, factor analysis, machine learning, (6 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.86)

Add feedback

Applications of electronic noses and tongues in food analysis

AITopics Original LinksJan-19-2017, 11:00:32 GMT

This review examines the applications of electronic noses and tongues in food analysis. A brief history of the development of sensors is included and this is illustrated by descriptions of the different types of sensors utilized in these devices. As pattern recognition techniques are widely used to analyse the data obtained from these multisensor arrays, a discussion of principal components analysis and artificial neural networks is essential. An introduction to the integration of electronic tongues and noses is also incorporated and the strengths and weaknesses of both are described. Applications described include identification and classification of flavour and aroma and other measurements of quality using the electronic nose.

application, artificial intelligence, principal component analysis, (3 more...)

AITopics Original Links

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.32)

Add feedback

Towards multiple kernel principal component analysis for integrative analysis of tumor samples

Speicher, Nora K., Pfeifer, Nico

arXiv.org Machine LearningJan-3-2017

Personalized treatment of patients based on tissue-specific cancer subtypes has strongly increased the efficacy of the chosen therapies. Even though the amount of data measured for cancer patients has increased over the last years, most cancer subtypes are still diagnosed based on individual data sources (e.g. gene expression data). We propose an unsupervised data integration method based on kernel principal component analysis. Principal component analysis is one of the most widely used techniques in data analysis. Unfortunately, the straight-forward multiple-kernel extension of this method leads to the use of only one of the input matrices, which does not fit the goal of gaining information from all data sources. Therefore, we present a scoring function to determine the impact of each input matrix. The approach enables visualizing the integrated data and subsequent clustering for cancer subtype identification. Due to the nature of the method, no free parameters have to be set. We apply the methodology to five different cancer data sets and demonstrate its advantages in terms of results and usability.

artificial intelligence, machine learning, variance, (17 more...)

arXiv.org Machine Learning

doi: 10.1515/jib-2017-0019

1701.00422

Country: Europe > Germany > Saarland (0.15)

Genre: Research Report (0.83)

Industry: Health & Medicine > Therapeutic Area > Oncology > Carcinoma (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.83)

Add feedback

Correlated-PCA: Principal Components' Analysis when Data and Noise are Correlated

Vaswani, Namrata, Guo, Han

Neural Information Processing SystemsDec-31-2016

Given a matrix of observed data, Principal Components Analysis (PCA) computes a small number of orthogonal directions that contain most of its variability. Provably accurate solutions for PCA have been in use for a long time. However, to the best of our knowledge, all existing theoretical guarantees for it assume that the data and the corrupting noise are mutually independent, or at least uncorrelated. This is valid in practice often, but not always. In this paper, we study the PCA problem in the setting where the data and noise can be correlated. Such noise is often also referred to as ``data-dependent noise". We obtain a correctness result for the standard eigenvalue decomposition (EVD) based solution to PCA under simple assumptions on the data-noise correlation. We also develop and analyze a generalization of EVD, cluster-EVD, that improves upon EVD in certain regimes.

artificial intelligence, assumption, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.61)

Add feedback

Policy Search with High-Dimensional Context Variables

Tangkaratt, Voot, van Hoof, Herke, Parisi, Simone, Neumann, Gerhard, Peters, Jan, Sugiyama, Masashi

arXiv.org Machine LearningNov-10-2016

Direct contextual policy search methods learn to improve policy parameters and simultaneously generalize these parameters to different context or task variables. However, learning from high-dimensional context variables, such as camera images, is still a prominent problem in many real-world tasks. A naive application of unsupervised dimensionality reduction methods to the context variables, such as principal component analysis, is insufficient as task-relevant input may be ignored. In this paper, we propose a contextual policy search method in the model-based relative entropy stochastic search framework with integrated dimensionality reduction. We learn a model of the reward that is locally quadratic in both the policy parameters and the context variables. Furthermore, we perform supervised linear dimensionality reduction on the context variables by nuclear norm regularization. The experimental results show that the proposed method outperforms naive dimensionality reduction via principal component analysis and a state-of-the-art contextual policy search method.

artificial intelligence, deep learning, machine learning, (20 more...)

arXiv.org Machine Learning

1611.03231

Country: North America > United States (0.46)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.45)

Add feedback

Correlated-PCA: Principal Components' Analysis when Data and Noise are Correlated

Vaswani, Namrata, Guo, Han

arXiv.org Machine LearningOct-31-2016

Given a matrix of observed data, Principal Components Analysis (PCA) computes a small number of orthogonal directions that contain most of its variability. Provably accurate solutions for PCA have been in use for a long time. However, to the best of our knowledge, all existing theoretical guarantees for it assume that the data and the corrupting noise are mutually independent, or at least uncorrelated. This is valid in practice often, but not always. In this paper, we study the PCA problem in the setting where the data and noise can be correlated. Such noise is often also referred to as "data-dependent noise". We obtain a correctness result for the standard eigenvalue decomposition (EVD) based solution to PCA under simple assumptions on the data-noise correlation. We also develop and analyze a generalization of EVD, cluster-EVD, that improves upon EVD in certain regimes.

artificial intelligence, machine learning, principal component analysis, (2 more...)

arXiv.org Machine Learning

1610.09307

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.60)

Add feedback

Efficient L1-Norm Principal-Component Analysis via Bit Flipping

Markopoulos, Panos P., Kundu, Sandipan, Chamadia, Shubham, Pados, Dimitris A.

arXiv.org Machine LearningOct-6-2016

It was shown recently that the $K$ L1-norm principal components (L1-PCs) of a real-valued data matrix $\mathbf X \in \mathbb R^{D \times N}$ ($N$ data samples of $D$ dimensions) can be exactly calculated with cost $\mathcal{O}(2^{NK})$ or, when advantageous, $\mathcal{O}(N^{dK - K + 1})$ where $d=\mathrm{rank}(\mathbf X)$, $K

algorithm, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

doi: 10.1109/TSP.2017.2708023

1610.01959

Country:

Europe (1.00)
North America > United States > New York (0.46)
North America > United States > California (0.46)

Genre: Research Report > New Finding (0.34)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.93)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.41)

Add feedback

Iteratively Reweighted Least Squares Algorithms for L1-Norm Principal Component Analysis

Park, Young Woong, Klabjan, Diego

arXiv.org Machine LearningSep-19-2016

Principal component analysis (PCA) is a technique to find orthonormal vectors, which are a linear combination of the attributes of the data, that explain the variance structure of the data [12]. Since a few orthonormal vectors usually explain most of the variance, PCA is often used to reduce dimension of the data by keeping only a few of the orthonormal vectors. These orthonormal vectors are called principal components (PCs). For dimensionality reduction, we are given target dimension p, the number of PCs. To measure accuracy, given p principal components, first, the original data is projected into the lower dimension using the PCs. Next, the projected data in the lower dimension is lifted to the original dimension using the PCs. Observe that this procedure causes loss of some information if p is smaller than the dimension of the original attribute space. The reconstruction error is defined by the difference between the projected-and-lifted data and the original data. To select the best p PCs, the following two objective functions are usually used: [P1] minimization of the reconstruction error, [P2] maximization of the variance of the projected data.

algorithm, artificial intelligence, machine learning, (14 more...)

arXiv.org Machine Learning

doi: 10.1109/ICDM.2016.0054

1609.02997

Country:

North America > United States (0.67)
Europe (0.46)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.61)

Add feedback

Filters

Collaborating Authors

Principal Component Analysis

Maximally Correlated Principal Component Analysis

Policy Search with High-Dimensional Context Variables

Can You Use Principal Component Analysis with a Training Set Test Set Model?

Applications of electronic noses and tongues in food analysis

Towards multiple kernel principal component analysis for integrative analysis of tumor samples

Correlated-PCA: Principal Components' Analysis when Data and Noise are Correlated

Policy Search with High-Dimensional Context Variables

Correlated-PCA: Principal Components' Analysis when Data and Noise are Correlated

Efficient L1-Norm Principal-Component Analysis via Bit Flipping

Iteratively Reweighted Least Squares Algorithms for L1-Norm Principal Component Analysis