AITopics | Asia

Towards Gradient-based Bilevel Optimization with Non-convex Followers and Beyond Risheng Liu1,2 Yaohua Liu1 Shangzhi Zeng3 Jin Zhang 4,5

Neural Information Processing SystemsApr-25-2026, 17:44:58 GMT

In recent years, Bi-Level Optimization (BLO) techniques have received extensive attentions from both learning and vision communities. A variety of BLO models in complex and practical tasks are of non-convex follower structure in nature (a.k.a., without Lower-Level Convexity, LLC for short). However, this challenging class of BLOs is lack of developments on both efficient solution strategies and solid theoretical guarantees. In this work, we propose a new algorithmic framework, named Initialization Auxiliary and Pessimistic Trajectory Truncated Gradient Method (IAPTT-GM), to partially address the above issues. In particular, by introducing an auxiliary as initialization to guide the optimization dynamics and designing a pessimistic trajectory truncation operation, we construct a reliable approximate version of the original BLO in the absence of LLC hypothesis. Our theoretical investigations establish the convergence of solutions returned by IAPTT-GM towards those of the original BLO without LLC. As an additional bonus, we also theoretically justify the quality of our IAPTT-GM embedded with Nesterov's accelerated dynamics under LLC. The experimental results confirm both the convergence of our algorithm without LLC, and the theoretical findings under LLC.

artificial intelligence, machine learning, optimization problem, (16 more...)

Neural Information Processing Systems

Country: Asia > China (0.28)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)

Add feedback

simple-saddle-camera-version

张辰逸

Neural Information Processing SystemsApr-25-2026, 17:14:13 GMT

Escaping saddle points is a central research topic in nonconvex optimization. In this paper, we propose a simple gradient-based algorithm such that for a smooth function f: Rn!R, it outputs an -approximate second-order stationary point in O(logn/ 1.75)iterations. Compared to the previous state-of-the-art algorithms by Jin et al. with O(log4 n/ 2) or O(log6 n/ 1.75) iterations, our algorithm is polynomially better in terms of logn and matches their complexities in terms of 1/ .

algorithm, artificial intelligence, machine learning, (13 more...)

Neural Information Processing Systems

Country: Asia (0.15)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.57)

Add feedback

47a658229eb2368a99f1d032c8848542-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 17:13:08 GMT

artificial intelligence, arxiv preprint arxiv, machine learning, (13 more...)

Neural Information Processing Systems

Country: Asia (0.28)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Data Science (0.94)

Add feedback

477bdb55b231264bb53a7942fd84254d-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 16:55:31 GMT

artificial intelligence, machine learning, regression, (19 more...)

Neural Information Processing Systems

Country: Asia (0.14)

Genre: Research Report > New Finding (0.47)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
(2 more...)

Add feedback

466473650870501e3600d9a1b4ee5d44-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 16:36:20 GMT

artificial intelligence, machine learning, perturbation, (17 more...)

Neural Information Processing Systems

Country: Asia > South Korea (0.28)

Industry: Education (0.94)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Supplementary Material: Memory-Efficient Approximation Algorithms for MAX-K-CUT and Correlation Clustering

Neural Information Processing SystemsApr-25-2026, 16:21:47 GMT

Let ϑ Rd1 and µ Rd2 be the dual variables corresponding to the d1 equality constraints and the d2 inequality constraints respectively. Let X? be an optimal solution to (SDP) and let X?FW be an optimal solution to (SDP-LSE). For ease of notation, let u= A(1)(X) b(1) andv = b(2) A(2)(X), (1) and define (bu,bv), (uFW,vFW) and (u?,v?) by substituting bX, XFW and X? respectively in (1). Upper bound on the objective. Rearranging the terms, using the duality of the `1 and ` norms, and the fact that µ? 0, gives hC, bX i hC,X?i+

artificial intelligence, diag, machine learning, (17 more...)

Neural Information Processing Systems

Country: Asia > India (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.56)
Information Technology > Artificial Intelligence > Machine Learning (0.46)

Add feedback

45c166d697d65080d54501403b433256-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 16:21:44 GMT

artificial intelligence, machine learning, survey article, (19 more...)

Neural Information Processing Systems

Country: Asia (0.28)

Genre:

Overview (0.68)
Research Report > New Finding (0.46)

Industry: Telecommunications (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.71)
Information Technology > Data Science (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Add feedback

Supplementary Material of Towards Enabling Meta-Learning from Target Models

Neural Information Processing SystemsApr-25-2026, 15:33:01 GMT

This is the supplementary material of paper "Towards Enabling Meta-Learning from Target Models". We give implementation details, more discussions, and more experiment results in this material.

artificial intelligence, machine learning, target model, (17 more...)

Neural Information Processing Systems

Country: Asia > China (0.15)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

43baa6762fa81bb43b39c62553b2970d-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 15:32:58 GMT

artificial intelligence, machine learning, target model, (14 more...)

Neural Information Processing Systems

Country: Asia > China (0.28)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Faster Query Times for Fully Dynamic k-Center Clustering with Outliers

Neural Information Processing SystemsApr-25-2026, 15:32:12 GMT

Given a point set P M from a metric space (M,d)and numbers k,z N, the metric k-center problem with z outliers is to find a set C P of k points such that the maximum distance of all but at most z outlier points of P to their nearest center in C is minimized. We consider this problem in the fully dynamic model, i.e., under insertions and deletions of points, for the case that the metric space has a bounded doubling dimension dim. We utilize a hierarchical data structure to maintain the points and their neighborhoods, which enables us to efficiently find the clusters. In particular, our data structure can be queried at any time to generate a (3 + ε)-approximate solution for input values of k and z in worst-case query time ε O(dim)klognloglog, where is the ratio between the maximum and minimum distance between two points in P. Moreover, it allows insertion/deletion of a point in worst-case update time ε O(dim) lognlog . Our result achieves a significantly faster query time with respect to k and z than the current state-of-theart by Pellizzoni, Pietracaprina, and Pucci [18], which uses ε O(dim)(k+z)2 log query time to obtain a (3+ε)-approximate solution.

artificial intelligence, dim, machine learning, (19 more...)

Neural Information Processing Systems

Country: