AITopics

2006.04527

Country:

Europe (0.28)
Africa > Cameroon > Gulf of Guinea (0.24)

Genre: Research Report (1.00)

Industry: Energy > Oil & Gas > Upstream (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

arXiv.org Machine LearningMay-23-2020

Principal Component Analysis Based on T$\ell_1$-norm Maximization

Yang, Xiang-Fei, Shao, Yuan-Hai, Li, Chun-Na, Liu, Li-Ming, Deng, Nai-Yang

Classical principal component analysis (PCA) may suffer from the sensitivity to outliers and noise. Therefore PCA based on $\ell_1$-norm and $\ell_p$-norm ($0 < p < 1$) have been studied. Among them, the ones based on $\ell_p$-norm seem to be most interesting from the robustness point of view. However, their numerical performance is not satisfactory. Note that, although T$\ell_1$-norm is similar to $\ell_p$-norm ($0 < p < 1$) in some sense, it has the stronger suppression effect to outliers and better continuity. So PCA based on T$\ell_1$-norm is proposed in this paper. Our numerical experiments have shown that its performance is superior than PCA-$\ell_p$ and $\ell_p$SPCA as well as PCA, PCA-$\ell_1$ obviously.

artificial intelligence, machine learning, tl 1, (14 more...)

2005.12263

Country:

Asia > China > Beijing > Beijing (0.05)
North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.65)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.62)

#artificialintelligenceMay-18-2020, 21:48:54 GMT

Your Ultimate Data Mining & Machine Learning Cheat Sheet

Dimensionality reduction is the process of expressing high-dimensional data in a reduced number of dimensions such that each one contains the most amount of information. Dimensionality reduction may be used for visualization of high-dimensional data or to speed up machine learning models by removing low-information or correlated features. Principal Component Analysis, or PCA, is a popular method of reducing the dimensionality of data by drawing several orthogonal (perpendicular) vectors in the feature space to represent the reduced number of dimensions. The variable number represents the number of dimensions the reduced data will have. In the case of visualization, for example, it would be two dimensions.

artificial intelligence, dimension, machine learning, (10 more...)

Industry:

Materials > Metals & Mining (0.40)
Machinery > Industrial Machinery (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.30)

#artificialintelligenceMay-9-2020, 17:14:17 GMT

Understanding Principal Component Analysis - GreatLearning

While working on different Machine Learning techniques for Data Analysis, we deal with hundreds or thousands of variables. Most of the variables are correlated with each other. Principal Component Analysis and Factor Analysis techniques are used to deal with such scenarios. Principal Component Analysis (PCA) is an unsupervised statistical technique algorithm. PCA is a "dimensionality reduction" method.

artificial intelligence, correlation, machine learning, (14 more...)

Country: Asia > India (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.84)

#artificialintelligenceMay-6-2020, 20:26:50 GMT

What is Principal Component Analysis in Machine Learning? Super Easy!

Do you wanna know What is Principal Component Analysis?. If yes, then this blog is just for you. Here I will discuss What is Principal Component Analysis, its purpose, and How PCA works?. So, give your few minutes to this article in order to get all the details regarding Principal Component Analysis. Principal Component Analysis(PCA) is one of the best-unsupervised algorithms.

artificial intelligence, machine learning, principal component analysis, (12 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (1.00)

arXiv.org Machine LearningMay-6-2020

A Communication-Efficient Distributed Algorithm for Kernel Principal Component Analysis

He, Fan, Huang, Xiaolin, Lv, Kexin, Yang, Jie

Principal Component Analysis (PCA) is a fundamental technology in machine learning. Nowadays many high-dimension large datasets are acquired in a distributed manner, which precludes the use of centralized PCA due to the high communication cost and privacy risk. Thus, many distributed PCA algorithms are proposed, most of which, however, focus on linear cases. To efficiently extract non-linear features, this brief proposes a communication-efficient distributed kernel PCA algorithm, where linear and RBF kernels are applied. The key is to estimate the global empirical kernel matrix from the eigenvectors of local kernel matrices. The approximate error of the estimators is theoretically analyzed for both linear and RBF kernels. The result suggests that when eigenvalues decay fast, which is common for RBF kernels, the proposed algorithm gives high quality results with low communication cost. Results of simulation experiments verify our theory analysis and experiments on GSE2187 dataset show the effectiveness of the proposed algorithm.

algorithm, artificial intelligence, machine learning, (17 more...)

2005.02664

Country:

Asia > China > Shanghai > Shanghai (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.61)

#artificialintelligenceMay-2-2020, 06:36:50 GMT

An overview of Principal Component Analysis

This article will explain you what Principal Component Analysis (PCA) is, why we need it and how we use it. I will try to make it as simple as possible while avoiding hard examples or words which can cause a headache. A moment of honesty: to fully understand this article, a basic understanding of some linear algebra and statistics is essential. Let's say we have 10 variables in our dataset and let's assume that 3 variables capture 90% of the dataset, and 7 variables capture 10% of the dataset. Let's say we want to visualize 10 variables.

dataset, eigenvector, matrix, (14 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.61)

arXiv.org Machine LearningApr-5-2020

Distributed Estimation for Principal Component Analysis: a Gap-free Approach

Chen, Xi, Lee, Jason D., Li, He, Yang, Yun

The growing size of modern data sets brings many challenges to the existing statistical estimation approaches, which calls for new distributed methodologies. This paper studies distributed estimation for a fundamental statistical machine learning problem, principal component analysis (PCA). Despite the massive literature on top eigenvector estimation, much less is presented for the top-$L$-dim ($L > 1$) eigenspace estimation, especially in a distributed manner. We propose a novel multi-round algorithm for constructing top-$L$-dim eigenspace for distributed data. Our algorithm takes advantage of shift-and-invert preconditioning and convex optimization. Our estimator is communication-efficient and achieves a fast convergence rate. In contrast to the existing divide-and-conquer algorithm, our approach has no restriction on the number of machines. Theoretically, we establish a gap-free error bound and abandon the assumption on the sharp eigengap between the $L$-th and the ($L+1$)-th eigenvalues. Our distributed algorithm can be applied to a wide range of statistical problems based on PCA. In particular, this paper illustrates two important applications, principal component regression and single index model, where our distributed algorithm can be extended. Finally, We provide simulation studies to demonstrate the performance of the proposed distributed estimator.

eigenvector, estimator, matrix, (13 more...)

2004.02336

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Illinois (0.04)
North America > United States > New York (0.04)

Genre: Research Report (1.00)

Industry: Education (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.61)

#artificialintelligenceApr-4-2020, 15:59:31 GMT

Machine Learning in Python: Principal Component Analysis (PCA) for Handling High-Dimensional Data

Machine Learning in Python: Principal Component Analysis (PCA) for Handling High-Dimensional Data In this video, I will be showing you how to perform principal component analysis (PCA) in Python using the scikit-learn package. PCA represents a powerful learning approach that enables the analysis of high-dimensional data as well as reveal the contribution of descriptors in governing the distribution of data clusters. Particularly, we will be creating PCA scree plot, scores plot and loadings plot. This video is part of the [Python Data Science Project] series. If you're new here, it would mean the world to me if you would consider subscribing to this channel.

amzn, handling high-dimensional data, principal component analysis, (9 more...)

Industry:

Retail > Online (0.41)
Information Technology > Services (0.41)
Health & Medicine > Consumer Health (0.34)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.85)

Archambeau, Cédric, Bach, Francis R.

Sparse probabilistic projections

Neural Information Processing SystemsMar-19-2020, 18:32:51 GMT

We present a generative model for performing sparse probabilistic projections, which includes sparse principal component analysis and sparse canonical correlation analysis as special cases. Sparsity is enforced by means of automatic relevance determination or by imposing appropriate prior distributions, such as generalised hyperbolic distributions. We derive a variational Expectation-Maximisation algorithm for the estimation of the hyperparameters and show that our novel probabilistic approach compares favourably to existing techniques. We illustrate how the proposed method can be applied in the context of cryptoanalysis as a pre-processing tool for the construction of template attacks. Papers published at the Neural Information Processing Systems Conference.

probabilistic projection, sparse probabilistic projection

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.33)