AITopics

Variational Annealing on Graphs for Combinatorial Optimization Sebastian Sanokowski 1,2 Wilhelm Berghammer 2 Sebastian Lehner

Neural Information Processing SystemsMay-25-2025, 11:44:03 GMT

Several recent unsupervised learning methods use probabilistic approaches to solve combinatorial optimization (CO) problems based on the assumption of statistically independent solution variables. We demonstrate that this assumption imposes performance limitations in particular on difficult problem instances. Our results corroborate that an autoregressive approach which captures statistical dependencies among solution variables yields superior performance on many popular CO problems. We introduce subgraph tokenization in which the configuration of a set of solution variables is represented by a single token. This tokenization technique alleviates the drawback of the long sequential sampling procedure which is inherent to autoregressive methods without sacrificing expressivity. Importantly, we theoretically motivate an annealed entropy regularization and show empirically that it is essential for efficient and stable learning.

artificial intelligence, machine learning, optimization problem, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.14)
Europe > United Kingdom > Scotland (0.14)
Europe > United Kingdom > England (0.14)
Europe > Austria > Upper Austria (0.14)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.46)

Add feedback

A Missing Proofs from Section 3

Neural Information Processing SystemsMay-25-2025, 11:43:48 GMT

By Lemma 5.2, this is Pr ( x K

artificial intelligence, machine learning, probability, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

The Gain of Ordering in Online Learning

Neural Information Processing SystemsMay-25-2025, 11:43:44 GMT

We study fixed-design online learning where the learner is allowed to choose the order of the datapoints in order to minimize their regret (aka self-directed online learning).

algorithm, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Europe (0.14)
Oceania > Australia (0.14)
Asia > China (0.14)

Genre: Research Report > New Finding (0.46)

Industry: Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.71)

Add feedback

Sub-optimality of the Naive Mean Field approximation for proportional high-dimensional Linear Regression

Neural Information Processing SystemsMay-25-2025, 11:43:25 GMT

The Naïve Mean Field (NMF) approximation is widely employed in modern Machine Learning due to the huge computational gains it bestows on the statistician. Despite its popularity in practice, theoretical guarantees for high-dimensional problems are only available under strong structural assumptions (e.g., sparsity). Moreover, existing theory often does not explain empirical observations noted in the existing literature. In this paper, we take a step towards addressing these problems by deriving sharp asymptotic characterizations for the NMF approximation in high-dimensional linear regression. Our results apply to a wide class of natural priors and allow for model mismatch (i.e., the underlying statistical model can be different from the fitted model).

approximation, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.70)

Add feedback

A Generalized Alternating Method for Bilevel Optimization under the Polyak-Łojasiewicz Condition

Neural Information Processing SystemsMay-25-2025, 11:43:07 GMT

Bilevel optimization has recently regained interest owing to its applications in emerging machine learning fields such as hyperparameter optimization, metalearning, and reinforcement learning. Recent results have shown that simple alternating (implicit) gradient-based algorithms can match the convergence rate of single-level gradient descent (GD) when addressing bilevel problems with a strongly convex lower-level objective. However, it remains unclear whether this result can be generalized to bilevel problems beyond this basic setting. In this paper, we first introduce a stationary metric for the considered bilevel problems, which generalizes the existing metric, for a nonconvex lower-level objective that satisfies the Polyak-Łojasiewicz (PL) condition.

artificial intelligence, machine learning, optimization, (16 more...)

Neural Information Processing Systems

Country:

Europe (1.00)
North America > United States > New York (0.28)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.48)

Add feedback

c981fd12b1d5703f19bd8289da9fc996-Paper-Conference.pdf

Neural Information Processing SystemsMay-25-2025, 11:43:04 GMT

artificial intelligence, machine learning, optimization, (16 more...)

Neural Information Processing Systems

Country:

Europe (1.00)
North America > United States > New York (0.28)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.48)

Add feedback

c972859a984a21658432d7320c7df385-Supplemental-Conference.pdf

Neural Information Processing SystemsMay-25-2025, 11:42:46 GMT

artificial intelligence, machine learning, verification, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.61)

Add feedback

Topological RANSAC for instance verification and retrieval without fine-tuning

Neural Information Processing SystemsMay-25-2025, 11:42:42 GMT

This paper presents an innovative visual reasoning approach to enhancing instance verification and retrieval, particularly in situations where a fine-tuning set is unavailable. The widely-used SPatial verification (SP) method, despite its efficacy, relies on a spatial model and the hypothesis-testing strategy for instance recognition, leading to inherent limitations, including the assumption of planar structures and neglect of topological relations among features. To address these shortcomings, we introduce a pioneering technique that replaces the spatial model with a topological one within the RANSAC process. We propose bio-inspired saccade and fovea functions to verify the topological consistency among features, effectively circumventing the issues associated with SP's spatial model. Our experimental results demonstrate that our method significantly outperforms SP, achieving stateof-the-art performance in non-fine-tuning retrieval. Furthermore, our approach can enhance performance when used in conjunction with fine-tuned features. Importantly, our method retains high explainability and is lightweight, offering a practical and adaptable solution for a variety of real-world applications. Our code can be found through this link.

artificial intelligence, machine learning, retrieval, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Europe > France (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.71)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

EDGI: Equivariant Diffusion for Planning with Embodied Agents Supplementary Material

Neural Information Processing SystemsMay-25-2025, 11:42:27 GMT

On a high level, EDGI follows Diffuser [1]. We illustrate the architecture in Figure 1 in the main paper. We use a kernel size of 5. This is essentially an equivariant version of LayerNorm. In the geometric layers, the input state is split into scalar and vector components.

artificial intelligence, machine learning, vector, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.98)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.41)

Add feedback

EDGI: Equivariant Diffusion for Planning with Embodied Agents

Neural Information Processing SystemsMay-25-2025, 11:42:24 GMT

Embodied agents operate in a structured world, often solving tasks with spatial, temporal, and permutation symmetries. Most algorithms for planning and modelbased reinforcement learning (MBRL) do not take this rich geometric structure into account, leading to sample inefficiency and poor generalization.

Add feedback

Filters

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Variational Annealing on Graphs for Combinatorial Optimization Sebastian Sanokowski 1,2 Wilhelm Berghammer 2 Sebastian Lehner

A Missing Proofs from Section 3

The Gain of Ordering in Online Learning

Sub-optimality of the Naive Mean Field approximation for proportional high-dimensional Linear Regression

A Generalized Alternating Method for Bilevel Optimization under the Polyak-Łojasiewicz Condition

c981fd12b1d5703f19bd8289da9fc996-Paper-Conference.pdf

c972859a984a21658432d7320c7df385-Supplemental-Conference.pdf

Topological RANSAC for instance verification and retrieval without fine-tuning

EDGI: Equivariant Diffusion for Planning with Embodied Agents Supplementary Material

EDGI: Equivariant Diffusion for Planning with Embodied Agents