AITopics | lower-level objective

On the Complexity of Finding Stationary Points in Nonconvex Simple Bilevel Optimization

Neural Information Processing SystemsJun-16-2026, 00:07:29 GMT

In this paper, we study the problem of solving a simple bilevel optimization problem, where the upper-level objective is minimized over the solution set of the lower-level problem. We focus on the general setting in which both the upper-and lower-level objectives are smooth but potentially nonconvex. Due to the absence of additional structural assumptions for the lower-level objective--such as convexity or the Polyak-Łojasiewicz (PL) condition--guaranteeing global optimality is generally intractable. Instead, we introduce a suitable notion of stationarity for this class of problems and aim to design a first-order algorithm that finds such stationary points in polynomial time. Intuitively, stationarity in this setting means the upper-level objective cannot be substantially improved locally without causing a larger deterioration in the lower-level objective. To this end, we show that a simple and implementable variant of the dynamic barrier gradient descent (DBGD) framework can effectively solve the considered nonconvex simple bilevel problems up to stationarity.

artificial intelligence, bilevel problem, machine learning, (18 more...)

Neural Information Processing Systems

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.34)

Add feedback

fe61e76998bbe3db53a6a48fa58207e9-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 20:12:49 GMT

artificial intelligence, machine learning, objective, (18 more...)

Neural Information Processing Systems

Country:

Asia > China (0.04)
Europe > Slovakia > Bratislava > Bratislava (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Penalty-based Methods for Simple Bilevel Optimization under Hölderian Error Bounds

Neural Information Processing SystemsDec-27-2025, 14:37:48 GMT

This paper investigates simple bilevel optimization problems where we minimize a convex upper-level objective over the optimal solution set of a convex lower-level objective. Existing methods for such problems either only guarantee asymptotic convergence, have slow sublinear rates, or require strong assumptions. To address these challenges, we propose a penalization framework that delineates the relationship between approximate solutions of the original problem and its reformulated counterparts.

artificial intelligence, optimization problem, proceedings, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.63)

Add feedback

An Alternating Optimization Method for Bilevel Problems under the Polyak-Łojasiewicz Condition

Neural Information Processing SystemsDec-26-2025, 18:35:44 GMT

Bilevel optimization has recently regained interest owing to its applications in emerging machine learning fields such as hyperparameter optimization, meta-learning, and reinforcement learning. Recent results have shown that simple alternating (implicit) gradient-based algorithms can match the convergence rate of single-level gradient descent (GD) when addressing bilevel problems with a strongly convex lower-level objective. However, it remains unclear whether this result can be generalized to bilevel problems beyond this basic setting. In this paper, we first introduce a stationary metric for the considered bilevel problems, which generalizes the existing metric, for a nonconvex lower-level objective that satisfies the Polyak-Łojasiewicz (PL) condition. We then propose a Generalized ALternating mEthod for bilevel opTimization (GALET) tailored to BLO with convex PL LL problem and establish that GALET achieves an $\epsilon$-stationary point for the considered problem within $\tilde{\cal O}(\epsilon^{-1})$ iterations, which matches the iteration complexity of GD for single-level smooth nonconvex problems.

alternating optimization method, bilevel problem, name change, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Penalty-based Methods for Simple Bilevel Optimization under Hölderian Error Bounds Pengyu Chen

Neural Information Processing SystemsOct-10-2025, 22:31:20 GMT

This paper investigates simple bilevel optimization problems where we minimize an upper-level objective over the optimal solution set of a convex lower-level objective. Existing methods for such problems either only guarantee asymptotic convergence, have slow sublinear rates, or require strong assumptions. To address these challenges, we propose a penalization framework that delineates the relationship between approximate solutions of the original problem and its reformulated counterparts.

iteration, objective, optimal solution, (16 more...)

Neural Information Processing Systems

Country:

Asia > China (0.04)
Europe > Slovakia > Bratislava > Bratislava (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

On the Complexity of Finding Stationary Points in Nonconvex Simple Bilevel Optimization

Cao, Jincheng, Jiang, Ruichen, Hamedani, Erfan Yazdandoost, Mokhtari, Aryan

arXiv.org Artificial IntelligenceAug-1-2025

In this paper, we study the problem of solving a simple bilevel optimization problem, where the upper-level objective is minimized over the solution set of the lower-level problem. We focus on the general setting in which both the upper- and lower-level objectives are smooth but potentially nonconvex. Due to the absence of additional structural assumptions for the lower-level objective-such as convexity or the Polyak-Łojasiewicz (PL) condition-guaranteeing global optimality is generally intractable. Instead, we introduce a suitable notion of stationarity for this class of problems and aim to design a first-order algorithm that finds such stationary points in polynomial time. Intuitively, stationarity in this setting means the upper-level objective cannot be substantially improved locally without causing a larger deterioration in the lower-level objective. To this end, we show that a simple and implementable variant of the dynamic barrier gradient descent (DBGD) framework can effectively solve the considered nonconvex simple bilevel problems up to stationarity. Specifically, to reach an $(ε_f, ε_g)$-stationary point-where $ε_f$ and $ε_g$ denote the target stationarity accuracies for the upper- and lower-level objectives, respectively-the considered method achieves a complexity of $\mathcal{O}\left(\max\left(ε_f^{-\frac{3+p}{1+p}}, ε_g^{-\frac{3+p}{2}}\right)\right)$, where $p \geq 0$ is an arbitrary constant balancing the terms. To the best of our knowledge, this is the first complexity result for a discrete-time algorithm that guarantees joint stationarity for both levels in general nonconvex simple bilevel problems.

artificial intelligence, bilevel problem, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2507.23155

Country: North America > United States > Arizona (0.28)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.34)

Add feedback

Penalty-based Methods for Simple Bilevel Optimization under Hölderian Error Bounds

Neural Information Processing SystemsMay-27-2025, 22:05:23 GMT

This paper investigates simple bilevel optimization problems where we minimize a convex upper-level objective over the optimal solution set of a convex lower-level objective. Existing methods for such problems either only guarantee asymptotic convergence, have slow sublinear rates, or require strong assumptions. To address these challenges, we propose a penalization framework that delineates the relationship between approximate solutions of the original problem and its reformulated counterparts. Specifically, when both upper- and lower-level objectives are composite convex functions, under an \alpha -Hölderian error bound condition and certain mild assumptions, our algorithm attains an (\epsilon,\epsilon {\beta}) -optimal solution of the original problem for any \beta 0 within \mathcal{O}\left(\sqrt{{1}/{\epsilon {\max\\{\alpha,\beta\\}}}}\right) iterations. The result can be improved further if the smooth part of the upper-level objective is strongly convex.

lower-level objective, objective, simple bilevel optimization, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.88)

Add feedback

BILBO: BILevel Bayesian Optimization

Chew, Ruth Wan Theng, Nguyen, Quoc Phong, Low, Bryan Kian Hsiang

arXiv.org Machine LearningFeb-4-2025

Bilevel optimization is characterized by a two-level optimization structure, where the upper-level problem is constrained by optimal lower-level solutions, and such structures are prevalent in real-world problems. The constraint by optimal lower-level solutions poses significant challenges, especially in noisy, constrained, and derivative-free settings, as repeating lower-level optimizations is sample inefficient and predicted lower-level solutions may be suboptimal. We present BILevel Bayesian Optimization (BILBO), a novel Bayesian optimization algorithm for general bilevel problems with blackbox functions, which optimizes both upper- and lower-level problems simultaneously, without the repeated lower-level optimization required by existing methods. BILBO samples from confidence-bounds based trusted sets, which bounds the suboptimality on the lower level. Moreover, BILBO selects only one function query per iteration, where the function query selection strategy incorporates the uncertainty of estimated lower-level solutions and includes a conditional reassignment of the query to encourage exploration of the lower-level objective. The performance of BILBO is theoretically guaranteed with a sublinear regret bound for commonly used kernels and is empirically evaluated on several synthetic and real-world problems.

artificial intelligence, machine learning, optimization, (16 more...)

arXiv.org Machine Learning

2502.02121

Country:

Asia > Singapore (0.04)
Oceania > Australia (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Research Report (0.64)

Industry: Energy > Power Industry (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

An Alternating Optimization Method for Bilevel Problems under the Polyak-Łojasiewicz Condition

Neural Information Processing SystemsJan-19-2025, 21:59:56 GMT

Bilevel optimization has recently regained interest owing to its applications in emerging machine learning fields such as hyperparameter optimization, meta-learning, and reinforcement learning. Recent results have shown that simple alternating (implicit) gradient-based algorithms can match the convergence rate of single-level gradient descent (GD) when addressing bilevel problems with a strongly convex lower-level objective. However, it remains unclear whether this result can be generalized to bilevel problems beyond this basic setting. In this paper, we first introduce a stationary metric for the considered bilevel problems, which generalizes the existing metric, for a nonconvex lower-level objective that satisfies the Polyak-Łojasiewicz (PL) condition. We then propose a Generalized ALternating mEthod for bilevel opTimization (GALET) tailored to BLO with convex PL LL problem and establish that GALET achieves an \epsilon -stationary point for the considered problem within \tilde{\cal O}(\epsilon {-1}) iterations, which matches the iteration complexity of GD for single-level smooth nonconvex problems.

alternating optimization method, bilevel problem, ojasiewicz condition, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.40)

Add feedback

Domain Adaptive Unfolded Graph Neural Networks

Zhang, Zepeng, Fink, Olga

arXiv.org Artificial IntelligenceNov-20-2024

Over the last decade, graph neural networks (GNNs) have made significant progress in numerous graph machine learning tasks. In real-world applications, where domain shifts occur and labels are often unavailable for a new target domain, graph domain adaptation (GDA) approaches have been proposed to facilitate knowledge transfer from the source domain to the target domain. Previous efforts in tackling distribution shifts across domains have mainly focused on aligning the node embedding distributions generated by the GNNs in the source and target domains. However, as the core part of GDA approaches, the impact of the underlying GNN architecture has received limited attention. In this work, we explore this orthogonal direction, i.e., how to facilitate GDA with architectural enhancement. In particular, we consider a class of GNNs that are designed explicitly based on optimization problems, namely unfolded GNNs (UGNNs), whose training process can be represented as bi-level optimization. Empirical and theoretical analyses demonstrate that when transferring from the source domain to the target domain, the lower-level objective value generated by the UGNNs significantly increases, resulting in an increase in the upper-level objective as well. Motivated by this observation, we propose a simple yet effective strategy called cascaded propagation (CP), which is guaranteed to decrease the lower-level objective value. The CP strategy is widely applicable to general UGNNs, and we evaluate its efficacy with three representative UGNN architectures. Extensive experiments on five real-world datasets demonstrate that the UGNNs integrated with CP outperform state-of-the-art GDA baselines.

pre, target domain, ugnn, (13 more...)

arXiv.org Artificial Intelligence

2411.13137

Country:

Europe > United Kingdom > England (0.04)
Europe > Germany (0.04)
Europe > Switzerland > Vaud > Lausanne (0.04)
Asia > Singapore (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Filters

Collaborating Authors

lower-level objective

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

On the Complexity of Finding Stationary Points in Nonconvex Simple Bilevel Optimization

fe61e76998bbe3db53a6a48fa58207e9-Paper-Conference.pdf

Penalty-based Methods for Simple Bilevel Optimization under Hölderian Error Bounds

An Alternating Optimization Method for Bilevel Problems under the Polyak-Łojasiewicz Condition

Penalty-based Methods for Simple Bilevel Optimization under Hölderian Error Bounds Pengyu Chen

On the Complexity of Finding Stationary Points in Nonconvex Simple Bilevel Optimization

Penalty-based Methods for Simple Bilevel Optimization under Hölderian Error Bounds

BILBO: BILevel Bayesian Optimization

An Alternating Optimization Method for Bilevel Problems under the Polyak-Łojasiewicz Condition

Domain Adaptive Unfolded Graph Neural Networks