AITopics | main stage

Tighten after Relax: Minimax-Optimal Sparse PCA in Polynomial Time

Neural Information Processing SystemsSep-30-2025, 08:58:31 GMT

We provide statistical and computational analysis of sparse Principal Component Analysis (PCA) in high dimensions. The sparse PCA problem is highly nonconvex in nature. Consequently, though its global solution attains the optimal statistical rate of convergence, such solution is computationally intractable to obtain. Meanwhile, although its convex relaxations are tractable to compute, they yield estimators with suboptimal statistical rates of convergence.

minimax-optimal sparse pca, name change, statistical guarantee, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.39)

Add feedback

Easy-to-Hard Learning for Information Extraction

Gao, Chang, Zhang, Wenxuan, Lam, Wai, Bing, Lidong

arXiv.org Artificial IntelligenceMay-19-2023

Information extraction (IE) systems aim to automatically extract structured information, such as named entities, relations between entities, and events, from unstructured texts. While most existing work addresses a particular IE task, universally modeling various IE tasks with one model has achieved great success recently. Despite their success, they employ a one-stage learning strategy, i.e., directly learning to extract the target structure given the input text, which contradicts the human learning process. In this paper, we propose a unified easy-to-hard learning framework consisting of three stages, i.e., the easy stage, the hard stage, and the main stage, for IE by mimicking the human learning process. By breaking down the learning process into multiple stages, our framework facilitates the model to acquire general IE task knowledge and improve its generalization ability. Extensive experiments across four IE tasks demonstrate the effectiveness of our framework. We achieve new state-of-the-art results on 13 out of 17 datasets. Our code is available at \url{https://github.com/DAMO-NLP-SG/IE-E2H}.

computational linguistic, data mining, machine learning, (21 more...)

arXiv.org Artificial Intelligence

2305.09193

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Middle East > Iraq (0.05)
Europe > United Kingdom (0.04)
(15 more...)

Genre:

Overview (0.68)
Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.94)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.88)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.71)
Information Technology > Data Science > Data Mining > Text Mining (0.61)

Add feedback

Tighten after Relax: Minimax-Optimal Sparse PCA in Polynomial Time

Wang, Zhaoran, Lu, Huanran, Liu, Han

Neural Information Processing SystemsFeb-14-2020, 12:56:10 GMT

We provide statistical and computational analysis of sparse Principal Component Analysis (PCA) in high dimensions. The sparse PCA problem is highly nonconvex in nature. Consequently, though its global solution attains the optimal statistical rate of convergence, such solution is computationally intractable to obtain. Meanwhile, although its convex relaxations are tractable to compute, they yield estimators with suboptimal statistical rates of convergence. In this paper, we propose a two-stage sparse PCA procedure that attains the optimal principal subspace estimator in polynomial time. The main stage employs a novel algorithm named sparse orthogonal iteration pursuit, which iteratively solves the underlying nonconvex problem.

estimator, minimax-optimal sparse pca, statistical guarantee, (8 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.44)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.41)

Add feedback

Don't miss the Q&A Sessions at Disrupt Berlin 2019 – TechCrunch

#artificialintelligenceOct-22-2019, 09:52:56 GMT

The early-stage startup community knows that the Disrupt Main Stage is the place to hear and learn from iconic founders, technologists and investment leaders. And the speakers you'll hear at Disrupt Berlin 2019 on 11-12 December will follow that grand tradition. Don't have a ticket yet? Buy your early bird pass today and save up to €500. "Always leave them wanting more" -- a phrase often attributed to P.T. Barnum -- also applies to the Disrupt Main Stage.

disrupt berlin 2019, main stage, techcrunch, (3 more...)

#artificialintelligence

Country: Europe > Germany > Berlin (0.67)

Technology:

Information Technology > Artificial Intelligence (0.54)
Information Technology > e-Commerce (0.34)

Add feedback

CognitionX

#artificialintelligenceJun-12-2018, 10:26:24 GMT

Streamed 18 hours ago This item has been hidden Popular uploads Play all 49:03 Cracking The Data Science Interview Part 1 - Duration: 49 minutes. CognitionX Panel - "How AI is Transforming the Online Customer Experience" - Duration: 1 hour.

artificial intelligence, cogx 2018, social media, (16 more...)

#artificialintelligence

Industry: Information Technology > Security & Privacy (0.32)

Technology:

Information Technology > Communications > Social Media (0.76)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.36)

Add feedback

Tighten after Relax: Minimax-Optimal Sparse PCA in Polynomial Time

Wang, Zhaoran, Lu, Huanran, Liu, Han

Neural Information Processing SystemsDec-31-2014

We provide statistical and computational analysis of sparse Principal Component Analysis (PCA) in high dimensions. The sparse PCA problem is highly nonconvex in nature. Consequently, though its global solution attains the optimal statistical rate of convergence, such solution is computationally intractable to obtain. Meanwhile, although its convex relaxations are tractable to compute, they yield estimators with suboptimal statistical rates of convergence. On the other hand, existing nonconvex optimization procedures, such as greedy methods, lack statistical guarantees. In this paper, we propose a two-stage sparse PCA procedure that attains the optimal principal subspace estimator in polynomial time. The main stage employs a novel algorithm named sparse orthogonal iteration pursuit, which iteratively solves the underlying nonconvex problem. However, our analysis shows that this algorithm only has desired computational and statistical guarantees within a restricted region, namely the basin of attraction. To obtain the desired initial estimator that falls into this region, we solve a convex formulation of sparse PCA with early stopping. Under an integrated analytic framework, we simultaneously characterize the computational and statistical performance of this two-stage procedure. Computationally, our procedure converges at the rate of $1/\sqrt{t}$ within the initialization stage, and at a geometric rate within the main stage. Statistically, the final principal subspace estimator achieves the minimax-optimal statistical rate of convergence with respect to the sparsity level $s^*$, dimension $d$ and sample size $n$. Our procedure motivates a general paradigm of tackling nonconvex statistical learning problems with provable statistical guarantees.

artificial intelligence, estimator, machine learning, (15 more...)

Neural Information Processing Systems

Technology: