AITopics | Optimization

Collaborating Authors

Optimization

News Overviews Instructional Materials AI-Alerts Classics

Data Pruning by Information Maximization

Tan, Haoru, Wu, Sitong, Huang, Wei, Zhao, Shizhen, Qi, Xiaojuan

arXiv.org Artificial IntelligenceAug-15-2025

In this paper, we present InfoMax, a novel data pruning method, also known as coreset selection, designed to maximize the information content of selected samples while minimizing redundancy. By doing so, InfoMax enhances the overall informativeness of the coreset. The information of individual samples is measured by importance scores, which capture their influence or difficulty in model learning. To quantify redundancy, we use pairwise sample similarities, based on the premise that similar samples contribute similarly to the learning process. We formalize the coreset selection problem as a discrete quadratic programming (DQP) task, with the objective of maximizing the total information content, represented as the sum of individual sample contributions minus the redundancies introduced by similar samples within the coreset. To ensure practical scalability, we introduce an efficient gradient-based solver, complemented by sparsification techniques applied to the similarity matrix and dataset partitioning strategies. This enables InfoMax to seamlessly scale to datasets with millions of samples. Extensive experiments demonstrate the superior performance of InfoMax in various data pruning tasks, including image classification, vision-language pre-training, and instruction tuning for large language models. Code is available at https://github.com/hrtan/InfoMax.

artificial intelligence, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2506.01701

Country: Asia (0.28)

Genre: Research Report (0.50)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

Sequential QCQP for Bilevel Optimization with Line Search

Sharifi, Sina, Hamedani, Erfan Yazdandoost, Fazlyab, Mahyar

arXiv.org Artificial IntelligenceAug-15-2025

Bilevel optimization involves a hierarchical structure where one problem is nested within another, leading to complex interdependencies between levels. We propose a single-loop, tuning-free algorithm that guarantees anytime feasibility, i.e., approximate satisfaction of the lower-level optimality condition, while ensuring descent of the upper-level objective. At each iteration, a convex quadratically-constrained quadratic program (QCQP) with a closed-form solution yields the search direction, followed by a backtracking line search inspired by control barrier functions to ensure safe, uniformly positive step sizes. The resulting method is scalable, requires no hyperparameter tuning, and converges under mild local regularity assumptions. We establish an O(1/k) ergodic convergence rate in terms of a first-order stationary metric and demonstrate the algorithm's effectiveness on representative bilevel tasks.

artificial intelligence, machine learning, optimization, (16 more...)

arXiv.org Artificial Intelligence

2505.14647

Country: North America > United States (0.46)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

57fbe68cb318cad62c4ae4c91c83cba3-Paper-Conference.pdf

Neural Information Processing SystemsAug-14-2025, 23:55:16 GMT

algorithm, constraint violation, convergence, (12 more...)

Neural Information Processing Systems

Country: North America > United States > Texas > Brazos County > College Station (0.05)

Genre: Research Report (0.93)

Industry: Government > Regional Government > North America Government > United States Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.94)

Add feedback

6a26c75d6a576c94654bfc4dda548c72-Paper.pdf

Neural Information Processing SystemsAug-14-2025, 23:40:18 GMT

algorithm, classification, test example, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Industry:

Health & Medicine (0.68)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

57b694fef23ae7b9308eb4d46342595d-Paper-Conference.pdf

Neural Information Processing SystemsAug-14-2025, 23:32:40 GMT

budget, configuration, optimization, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Germany > Baden-Württemberg > Freiburg (0.05)
North America > Canada > Quebec > Montreal (0.04)
(21 more...)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Domain Generalization without Excess Empirical Risk

Neural Information Processing SystemsAug-14-2025, 23:13:35 GMT

We present an approach that eliminates this problem.

domain generalization, empirical risk, generalization, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
Africa (0.04)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Two-step lookahead Bayesian optimization with inequality constraints

Neural Information Processing SystemsAug-14-2025, 22:56:39 GMT

This greedy behavior may hinder an algorithm's

acquisition function, constraint, optimization, (15 more...)

Neural Information Processing Systems

Country:

Europe > Spain > Andalusia > Cádiz Province > Cadiz (0.04)
Europe > Iceland > Capital Region > Reykjavik (0.04)
Europe > France > Hauts-de-France > Nord > Lille (0.04)
Asia > Russia > Siberian Federal District > Novosibirsk Oblast > Novosibirsk (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.94)

Add feedback

Several recent works have aimed to design algorithms that are robust to adversarial perturbations. In practice, first order methods are a popular choice for adversarial training. These methods are appealing since they are generally applicable to many network architecture and only rely on black box access to gradient information. Hence, analyzing this algorithm is interesting in its own right. We first discuss the setting of the hyperparameters in our experiments.

algorithm, matrix, robustness, (17 more...)

Neural Information Processing Systems

Technology: