AITopics | Peter Richtarik

Collaborating Authors

Peter Richtarik

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

SEGA: Variance Reduction via Gradient Sketching

Filip Hanzely, Konstantin Mishchenko, Peter Richtarik

Neural Information Processing SystemsMay-26-2025, 11:12:55 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, optimization problem, (15 more...)

Neural Information Processing Systems

Country: North America > Canada (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Stochastic Spectral and Conjugate Descent Methods

Dmitry Kovalev, Peter Richtarik, Eduard Gorbunov, Elnur Gasanov

Neural Information Processing SystemsMay-26-2025, 10:43:38 GMT

The state-of-the-art methods for solving optimization problems in big dimensions are variants of randomized coordinate descent (RCD). In this paper we introduce a fundamentally new type of acceleration strategy for RCD based on the augmentation of the set of coordinate directions by a few spectral or conjugate directions. As we increase the number of extra directions to be sampled from, the rate of the method improves, and interpolates between the linear rate of RCD and a linear rate independent of the condition number. We develop and analyze also inexact variants of these methods where the spectral and conjugate directions are allowed to be approximate only. We motivate the above development by proving several negative results which highlight the limitations of RCD with importance sampling.

artificial intelligence, optimization problem, probability, (18 more...)

Neural Information Processing Systems

Country:

Europe > Russia (0.28)
Asia > Middle East > Saudi Arabia (0.14)

Genre: Research Report (0.48)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.49)

Add feedback

Accelerated Stochastic Matrix Inversion: General Theory and Speeding up BFGS Rules for Faster Second-Order Optimization

Robert Gower, Filip Hanzely, Peter Richtarik, Sebastian U. Stich

Neural Information Processing SystemsMay-26-2025, 10:13:13 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland (0.28)
Asia > Middle East > Saudi Arabia (0.14)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)

Add feedback

SEGA: Variance Reduction via Gradient Sketching

Filip Hanzely, Konstantin Mishchenko, Peter Richtarik

Neural Information Processing SystemsMar-27-2025, 05:32:47 GMT

We propose a randomized first order optimization method--SEGA (SkEtched GrAdient)--which progressively throughout its iterations builds a variancereduced estimate of the gradient from random linear measurements (sketches) of the gradient. In each iteration, SEGA updates the current estimate of the gradient through a sketch-and-project operation using the information provided by the latest sketch, and this is subsequently used to compute an unbiased estimate of the true gradient through a random relaxation procedure. This unbiased estimate is then used to perform a gradient step. Unlike standard subspace descent methods, such as coordinate descent, SEGA can be used for optimization problems with a non-separable proximal term. We provide a general convergence analysis and prove linear convergence for strongly convex objectives. In the special case of coordinate sketches, SEGA can be enhanced with various techniques such as importance sampling, minibatching and acceleration, and its rate is up to a small constant factor identical to the best-known rate of coordinate descent.

artificial intelligence, machine learning, optimization problem, (15 more...)

Neural Information Processing Systems

Country: North America (0.28)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Stochastic Spectral and Conjugate Descent Methods

Dmitry Kovalev, Peter Richtarik, Eduard Gorbunov, Elnur Gasanov

Neural Information Processing SystemsMar-27-2025, 03:31:27 GMT

artificial intelligence, optimization problem, probability, (17 more...)

Neural Information Processing Systems

Country:

Europe > Russia (0.28)
Asia > Middle East (0.28)

Genre: Research Report (0.48)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.49)

Add feedback

Accelerated Stochastic Matrix Inversion: General Theory and Speeding up BFGS Rules for Faster Second-Order Optimization

Robert Gower, Filip Hanzely, Peter Richtarik, Sebastian U. Stich

Neural Information Processing SystemsMar-27-2025, 01:18:13 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Asia > Middle East (0.28)
Europe > Switzerland (0.28)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)

Add feedback

Stochastic Proximal Langevin Algorithm: Potential Splitting and Nonasymptotic Rates

Adil SALIM, Dmitry Koralev, Peter Richtarik

Neural Information Processing SystemsMar-23-2025, 23:53:49 GMT

We propose a new algorithm--Stochastic Proximal Langevin Algorithm (SPLA)--for sampling from a log concave distribution. Our method is a generalization of the Langevin algorithm to potentials expressed as the sum of one stochastic smooth term and multiple stochastic nonsmooth terms. In each iteration, our splitting technique only requires access to a stochastic gradient of the smooth term and a stochastic proximal operator for each of the nonsmooth terms. We establish nonasymptotic sublinear and linear convergence rates under convexity and strong convexity of the smooth term, respectively, expressed in terms of the KL divergence and Wasserstein distance. We illustrate the efficiency of our sampling technique through numerical simulations on a Bayesian learning task.

artificial intelligence, bayesian inference, machine learning, (14 more...)

Neural Information Processing Systems

Country: