AITopics | Optimization

Collaborating Authors

Optimization

News Overviews Instructional Materials AI-Alerts Classics

Near-optimal delta-convex estimation of Lipschitz functions

arXiv.org Machine LearningNov-20-2025

This paper presents a tractable algorithm for estimating an unknown Lipschitz function from noisy observations and establishes an upper bound on its convergence rate. The approach extends max-affine methods from convex shape-restricted regression to the more general Lipschitz setting. A key component is a nonlinear feature expansion that maps max-affine functions into a subclass of delta-convex functions, which act as universal ap-proximators of Lipschitz functions while preserving their Lipschitz constants. Leveraging this property, the estimator attains the minimax convergence rate (up to logarithmic factors) with respect to the intrinsic dimension of the data under squared loss and subgaussian distributions in the random design setting. The algorithm integrates adaptive partitioning to capture intrinsic dimension, a penalty-based regularization mechanism that removes the need to know the true Lipschitz constant, and a two-stage optimization procedure combining a convex initialization with local refinement. The framework is also straightforward to adapt to convex shape-restricted regression. Experiments demonstrate competitive performance relative to other theoretically justified methods, including nearest-neighbor and kernel-based regressors.

artificial intelligence, estimator, machine learning, (17 more...)

arXiv.org Machine Learning

2511.15615

Country:

North America > United States (0.67)
North America > Canada (0.45)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

d482f1362bd6a8448d7c35e717c7063a-Paper-Conference.pdf

Neural Information Processing SystemsNov-19-2025, 21:09:02 GMT

artificial intelligence, evolutionary algorithm, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Long Beach (0.14)
North America > United States > California > San Diego County > San Diego (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
(20 more...)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (0.70)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
(2 more...)

Add feedback

Tighter Convergence Bounds for Shuffled SGD via Primal-Dual Perspective

Neural Information Processing SystemsNov-19-2025, 19:33:43 GMT

Appendix E), the variance across multiple runs is negligible, hence the ribbons are not observable.

artificial intelligence, convergence, machine learning, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > Wisconsin > Dane County > Madison (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)
Information Technology > Mathematics of Computing (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.46)

Add feedback

7e3491e922bfd199ea34ecafeb7380f0-Paper-Conference.pdf

Neural Information Processing SystemsNov-19-2025, 18:26:04 GMT

artificial intelligence, machine learning, optimization, (18 more...)

Neural Information Processing Systems

Country: Asia > South Korea (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine (0.46)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Vision (0.67)

Add feedback

Abductive Reasoning in Logical Credal Networks

Neural Information Processing SystemsNov-19-2025, 18:21:27 GMT

Logical Credal Networks or LCNs were recently introduced as a powerful probabilistic logic framework for representing and reasoning with imprecise knowledge.

logic & formal reasoning, machine learning, natural language, (22 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
South America > Brazil > São Paulo (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.93)

Industry: Health & Medicine > Therapeutic Area (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.68)
(4 more...)

Add feedback

Computational Separations between Sampling and Optimization

Kunal Talwar

Neural Information Processing SystemsNov-19-2025, 16:03:38 GMT

Recent work [Ma et al., 2019] shows that in the non-convex case, sampling

artificial intelligence, exp, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > District of Columbia > Washington (0.04)
North America > United States > California > Santa Clara County > Mountain View (0.04)
North America > Canada (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)

Add feedback

T2T: From Distribution Learning in Training to Gradient Search in Testing for Combinatorial Optimization

Neural Information Processing SystemsNov-19-2025, 10:56:11 GMT

Figure 2: Diffusion modeling for CO solving where the model learns how to gradually denoise from the random noise to predict each step's

artificial intelligence, machine learning, solver, (13 more...)

Neural Information Processing Systems

Country: Asia > China > Shanghai > Shanghai (0.04)

Genre: Instructional Material > Course Syllabus & Notes (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Differentiable Random Partition Models

Neural Information Processing SystemsNov-19-2025, 06:45:10 GMT

Partitioning a set of elements into an unknown number of mutually exclusive subsets is essential in many machine learning problems.

artificial intelligence, experiment, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Florida > Palm Beach County > Boca Raton (0.04)
North America > Canada > British Columbia > Vancouver (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(4 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

e046ede63264b10130007afca077877f-AuthorFeedback.pdf

Neural Information Processing SystemsNov-19-2025, 06:42:43 GMT

We answer major comments from each reviewer below; we'll fix the minor ones. REVIEWER 1: "This paper ranks high in novelty...The experimental results are strong, especially on T ext Some important details are unclear . E.g. what is the base distribution for sampling? REVIEWER 2: "Originality: This paper is the first demonstration of flow-based models to discrete data. As such, the work is fairly novel....That being said, the main technical contribution amounts to...on top of the We agree about simplicity being a benefit.

base distribution, discrete flow, gradient, (11 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.31)
Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

SparseST: Exploiting Data Sparsity in Spatiotemporal Modeling and Prediction

Wu, Junfeng, Benmeziane, Hadjer, Maghraoui, Kaoutar El, Liu, Liu, Wang, Yinan

arXiv.org Artificial IntelligenceNov-19-2025

Spatiotemporal data mining (STDM) has a wide range of applications in various complex physical systems (CPS), i.e., transportation, manufacturing, healthcare, etc. Among all the proposed methods, the Convolutional Long Short-Term Memory (ConvLSTM) has proved to be generalizable and extendable in different applications and has multiple variants achieving state-of-the-art performance in various STDM applications. However, ConvLSTM and its variants are computationally expensive, which makes them inapplicable in edge devices with limited computational resources. With the emerging need for edge computing in CPS, efficient AI is essential to reduce the computational cost while preserving the model performance. Common methods of efficient AI are developed to reduce redundancy in model capacity (i.e., model pruning, compression, etc.). However, spatiotemporal data mining naturally requires extensive model capacity, as the embedded dependencies in spatiotemporal data are complex and hard to capture, which limits the model redundancy. Instead, there is a fairly high level of data and feature redundancy that introduces an unnecessary computational burden, which has been largely overlooked in existing research. Therefore, we developed a novel framework SparseST, that pioneered in exploiting data sparsity to develop an efficient spatiotemporal model. In addition, we explore and approximate the Pareto front between model performance and computational efficiency by designing a multi-objective composite loss function, which provides a practical guide for practitioners to adjust the model according to computational resource constraints and the performance requirements of downstream tasks.

data mining, machine learning, sparse convolution, (21 more...)

arXiv.org Artificial Intelligence

2511.14753

Genre: