AITopics | Principal Component Analysis

Collaborating Authors

Principal Component Analysis

News Overviews Instructional Materials AI-Alerts Classics

Objective-Sensitive Principal Component Analysis for High-Dimensional Inverse Problems

Elizarev, Maksim, Mukhin, Andrei, Khlyupin, Aleksey

arXiv.org Machine LearningJun-2-2020

We present a novel approach for adaptive, differentiable parameterization of large-scale random fields. If the approach is coupled with any gradient-based optimization algorithm, it can be applied to a variety of optimization problems, including history matching. The developed technique is based on principal component analysis (PCA) but modifies a purely data-driven basis of principal components considering objective function behavior. To define an efficient encoding, Gradient-Sensitive PCA uses an objective function gradient with respect to model parameters. We propose computationally efficient implementations of the technique, and two of them are based on stationary perturbation theory (SPT). Optimality, correctness, and low computational costs of the new encoding approach are tested, verified, and discussed. Three algorithms for optimal parameter decomposition are presented and applied to an objective of 2D synthetic history matching. The results demonstrate improvements in encoding quality regarding objective function minimization and distributional patterns of the desired field. Possible applications and extensions are proposed.

deep learning, principal components, upstream oil & gas, (18 more...)

arXiv.org Machine Learning

2006.04527

Country:

Europe (0.28)
Africa > Cameroon > Gulf of Guinea (0.24)

Genre: Research Report (1.00)

Industry: Energy > Oil & Gas > Upstream (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Principal Component Analysis Based on T$\ell_1$-norm Maximization

Yang, Xiang-Fei, Shao, Yuan-Hai, Li, Chun-Na, Liu, Li-Ming, Deng, Nai-Yang

arXiv.org Machine LearningMay-23-2020

Classical principal component analysis (PCA) may suffer from the sensitivity to outliers and noise. Therefore PCA based on $\ell_1$-norm and $\ell_p$-norm ($0 < p < 1$) have been studied. Among them, the ones based on $\ell_p$-norm seem to be most interesting from the robustness point of view. However, their numerical performance is not satisfactory. Note that, although T$\ell_1$-norm is similar to $\ell_p$-norm ($0 < p < 1$) in some sense, it has the stronger suppression effect to outliers and better continuity. So PCA based on T$\ell_1$-norm is proposed in this paper. Our numerical experiments have shown that its performance is superior than PCA-$\ell_p$ and $\ell_p$SPCA as well as PCA, PCA-$\ell_1$ obviously.

artificial intelligence, machine learning, tl 1, (13 more...)

arXiv.org Machine Learning

2005.12263

Country:

Asia > China (0.15)
North America > United States (0.14)

Genre: Research Report (0.65)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.62)

Add feedback

Your Ultimate Data Mining & Machine Learning Cheat Sheet

#artificialintelligenceMay-18-2020, 21:48:54 GMT

Dimensionality reduction is the process of expressing high-dimensional data in a reduced number of dimensions such that each one contains the most amount of information. Dimensionality reduction may be used for visualization of high-dimensional data or to speed up machine learning models by removing low-information or correlated features. Principal Component Analysis, or PCA, is a popular method of reducing the dimensionality of data by drawing several orthogonal (perpendicular) vectors in the feature space to represent the reduced number of dimensions. The variable number represents the number of dimensions the reduced data will have. In the case of visualization, for example, it would be two dimensions.

artificial intelligence, data mining & machine learning cheat sheet, metals & mining, (6 more...)

#artificialintelligence

Industry:

Materials > Metals & Mining (0.40)
Machinery > Industrial Machinery (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.30)

Add feedback

Understanding Principal Component Analysis - GreatLearning

#artificialintelligenceMay-9-2020, 17:14:17 GMT

While working on different Machine Learning techniques for Data Analysis, we deal with hundreds or thousands of variables. Most of the variables are correlated with each other. Principal Component Analysis and Factor Analysis techniques are used to deal with such scenarios. Principal Component Analysis (PCA) is an unsupervised statistical technique algorithm. PCA is a "dimensionality reduction" method.

artificial intelligence, machine learning, principal component analysis, (17 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.84)

Add feedback

What is Principal Component Analysis in Machine Learning? Super Easy!

#artificialintelligenceMay-6-2020, 20:26:50 GMT

Do you wanna know What is Principal Component Analysis?. If yes, then this blog is just for you. Here I will discuss What is Principal Component Analysis, its purpose, and How PCA works?. So, give your few minutes to this article in order to get all the details regarding Principal Component Analysis. Principal Component Analysis(PCA) is one of the best-unsupervised algorithms.

artificial intelligence, machine learning, principal component analysis, (15 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (1.00)

Add feedback

A Communication-Efficient Distributed Algorithm for Kernel Principal Component Analysis

He, Fan, Huang, Xiaolin, Lv, Kexin, Yang, Jie

arXiv.org Machine LearningMay-6-2020

Principal Component Analysis (PCA) is a fundamental technology in machine learning. Nowadays many high-dimension large datasets are acquired in a distributed manner, which precludes the use of centralized PCA due to the high communication cost and privacy risk. Thus, many distributed PCA algorithms are proposed, most of which, however, focus on linear cases. To efficiently extract non-linear features, this brief proposes a communication-efficient distributed kernel PCA algorithm, where linear and RBF kernels are applied. The key is to estimate the global empirical kernel matrix from the eigenvectors of local kernel matrices. The approximate error of the estimators is theoretically analyzed for both linear and RBF kernels. The result suggests that when eigenvalues decay fast, which is common for RBF kernels, the proposed algorithm gives high quality results with low communication cost. Results of simulation experiments verify our theory analysis and experiments on GSE2187 dataset show the effectiveness of the proposed algorithm.

algorithm, artificial intelligence, machine learning, (18 more...)

arXiv.org Machine Learning

2005.02664

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.61)

Add feedback

Sparse probabilistic projections

Archambeau, Cédric, Bach, Francis R.

Neural Information Processing SystemsMar-19-2020, 18:32:51 GMT

We present a generative model for performing sparse probabilistic projections, which includes sparse principal component analysis and sparse canonical correlation analysis as special cases. Sparsity is enforced by means of automatic relevance determination or by imposing appropriate prior distributions, such as generalised hyperbolic distributions. We derive a variational Expectation-Maximisation algorithm for the estimation of the hyperparameters and show that our novel probabilistic approach compares favourably to existing techniques. We illustrate how the proposed method can be applied in the context of cryptoanalysis as a pre-processing tool for the construction of template attacks. Papers published at the Neural Information Processing Systems Conference.

artificial intelligence, machine learning, sparse probabilistic projection

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.33)

Add feedback

Demixed Principal Component Analysis

Brendel, Wieland, Romo, Ranulfo, Machens, Christian K.

Neural Information Processing SystemsFeb-15-2020, 00:11:57 GMT

In many experiments, the data points collected live in high-dimensional observation spaces, yet can be assigned a set of labels or parameters. In electrophysiological recordings, for instance, the responses of populations of neurons generally depend on mixtures of experimentally controlled parameters. The heterogeneity and diversity of these parameter dependencies can make visualization and interpretation of such data extremely difficult. Standard dimensionality reduction techniques such as principal component analysis (PCA) can provide a succinct and complete description of the data, but the description is constructed independent of the relevant task variables and is often hard to interpret. Here, we start with the assumption that a particularly informative description is one that reveals the dependency of the high-dimensional data on the individual parameters.

artificial intelligence, machine learning, principal component analysis, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.68)

Add feedback

Streaming Kernel PCA with \tilde{O}(\sqrt{n}) Random Features

Ullah, Enayat, Mianjy, Poorya, Marinov, Teodor Vanislavov, Arora, Raman

Neural Information Processing SystemsFeb-14-2020, 19:41:55 GMT

We study the statistical and computational aspects of kernel principal component analysis using random Fourier features and show that under mild assumptions, $O(\sqrt{n} \log n)$ features suffices to achieve $O(1/\epsilon 2)$ sample complexity. Furthermore, we give a memory efficient streaming algorithm based on classical Oja's algorithm that achieves this rate Papers published at the Neural Information Processing Systems Conference.

artificial intelligence, machine learning, random feature, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.37)

Add feedback

On the Sample Complexity of Subspace Learning

Rudi, Alessandro, Canas, Guillermo D., Rosasco, Lorenzo

Neural Information Processing SystemsFeb-14-2020, 17:58:10 GMT

A large number of algorithms in machine learning, from principal component analysis (PCA), and its non-linear (kernel) extensions, to more recent spectral embedding and support estimation methods, rely on estimating a linear subspace from samples. In this paper we introduce a general formulation of this problem and derive novel learning error estimates. Our results rely on natural assumptions on the spectral properties of the covariance operator associated to the data distribution, and hold for a wide class of metrics between subspaces. As special cases, we discuss sharp error estimates for the reconstruction properties of PCA and spectral support estimation. Key to our analysis is an operator theoretic approach that has broad applicability to spectral learning methods.

artificial intelligence, machine learning, subspace, (2 more...)

Neural Information Processing Systems

Genre: Research Report (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.31)

Add feedback