AITopics | feldman

First, we show the negativeresult that no strongly sublinear sized coresets existforlogisticregression.

artificial intelligence, machine learning, regression, (16 more...)

Neural Information Processing Systems

Country:

Europe > Germany (0.05)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > Italy (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

CoresetforLine-SetsClustering

Neural Information Processing SystemsFeb-12-2026, 20:13:09 GMT

A natural generalization is to replace this input setP of n points by a setP of n sets inX. The distance from such an input setP P to a setC of centers can then be defined as the distance between the closest point-center pair. This problem is calledk-mean for sets; see e.g.

artificial intelligence, li 1, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

CoresetforLine-SetsClustering

Neural Information Processing SystemsFeb-12-2026, 20:13:05 GMT

A natural generalization is to replace this input setP of n points by a setP of n sets inX. The distance from such an input setP P to a setC of centers can then be defined as the distance between the closest point-center pair. This problem is calledk-mean for sets; see e.g.

artificial intelligence, feldman, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Add feedback

Black-box Coreset Variational Inference

Neural Information Processing SystemsFeb-12-2026, 08:03:20 GMT

artificial intelligence, machine learning, neurips, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Orange County > Irvine (0.04)
North America > Canada > Ontario > Toronto (0.04)

Industry: Transportation > Air (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.69)

Add feedback

Deep

Neural Information Processing SystemsFeb-11-2026, 15:11:29 GMT

Asaresult, sinceeach 6 Table 1: results (", )-DPwith = 10 5. The? indicates representations.

artificial intelligence, machine learning, talwar, (19 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > Ontario > Toronto (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Ontheuniversalityofdeeplearning

Neural Information Processing SystemsFeb-10-2026, 22:07:23 GMT

artificial intelligence, latexit sha1, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(3 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

All ERMs Can Fail in Stochastic Convex Optimization Lower Bounds in Linear Dimension

Burla, Tal, Livni, Roi

arXiv.org Machine LearningFeb-10-2026

We study the sample complexity of the best-case Empirical Risk Minimizer in the setting of stochastic convex optimization. We show that there exists an instance in which the sample size is linear in the dimension, learning is possible, but the Empirical Risk Minimizer is likely to be unique and to overfit. This resolves an open question by Feldman. We also extend this to approximate ERMs. Building on our construction we also show that (constrained) Gradient Descent potentially overfits when horizon and learning rate grow w.r.t sample size. Specifically we provide a novel generalization lower bound of $Ω\left(\sqrt{ηT/m^{1.5}}\right)$ for Gradient Descent, where $η$ is the learning rate, $T$ is the horizon and $m$ is the sample size. This narrows down, exponentially, the gap between the best known upper bound of $O(ηT/m)$ and existing lower bounds from previous constructions.

artificial intelligence, erm, machine learning, (14 more...)

arXiv.org Machine Learning

2602.0835

Country:

Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.56)

Add feedback

2 BayesianCoresets Inthiswork,thegoalistoapproximate expectations under adensityπ(θ), θ Θexpressed asthe productofN potentials(f(xn,θ))Nn=1 andabasedensityπ0(θ): π(θ): = 1 Z exp

Neural Information Processing SystemsFeb-9-2026, 18:41:12 GMT

Large-scale data--which has become the norm in many scientific and commercial applications of statistical machine learning--creates an inherently difficult setting for the modern data analyst.

artificial intelligence, inference, machine learning, (15 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry: Information Technology (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

LearningwithUser-LevelPrivacy

Neural Information Processing SystemsFeb-9-2026, 03:32:07 GMT

Releasing seemingly innocuous functions of a data set can easily compromise the privacy of individuals, whether the functions are simple counts [35]orcomplexmachine learning models like deep neural networks [52,30].

artificial intelligence, deep learning, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Can Implicit Bias Explain Generalization Stochastic Convex Optimization Case Study

Neural Information Processing SystemsFeb-8-2026, 12:17:29 GMT

One of the great mysteries of contemporary machine learning is the impressive success ofunregularized and overparameterized learningalgorithms. In detail,current machinelearningpracticeis to trainmodels with far more parameters than samples and let the algorithmfit the data, oftentimes without any type of regularization. In fact, these algorithms are so overcapacitated that they can even memorize and fit random data (Neyshabur et al., 2015; Zhang et al., 2017). Yet, when trained on real-life data, these algorithms show remarkable performance in generalizing to unseen samples. This phenomenon is often attributed to what is described as theimplicit-regularization of an algorithm (Neyshabur et al., 2015). Implicit regularization roughly refers to the learner's preference to implicitly choosing certain structured solutionsas if some explicit regularization term appeared in its objective.

artificial intelligence, machine learning, regularizer, (18 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
Europe > France (0.04)
Asia > Japan > Kyūshū & Okinawa > Okinawa (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.64)

Add feedback

Filters

Collaborating Authors

feldman

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

On Coresets for Logistic Regression

CoresetforLine-SetsClustering

CoresetforLine-SetsClustering

Black-box Coreset Variational Inference

Deep

Ontheuniversalityofdeeplearning

All ERMs Can Fail in Stochastic Convex Optimization Lower Bounds in Linear Dimension

2 BayesianCoresets Inthiswork,thegoalistoapproximate expectations under adensityπ(θ), θ Θexpressed asthe productofN potentials(f(xn,θ))Nn=1 andabasedensityπ0(θ): π(θ): = 1 Z exp

LearningwithUser-LevelPrivacy

Can Implicit Bias Explain Generalization Stochastic Convex Optimization Case Study