AITopics | Mathematical & Statistical Methods

Collaborating Authors

Mathematical & Statistical Methods

News Overviews Instructional Materials AI-Alerts Classics

Bayesian Sampling Using Stochastic Gradient Thermostats

Nan Ding, Youhan Fang, Ryan Babbush, Changyou Chen, Robert D. Skeel, Hartmut Neven

Neural Information Processing SystemsFeb-8-2025, 20:38:42 GMT

Dynamics-based sampling methods, such as Hybrid Monte Carlo (HMC) and Langevin dynamics (LD), are commonly used to sample target distributions. Recently, such approaches have been combined with stochastic gradient techniques to increase sampling efficiency when dealing with large datasets. An outstanding problem with this approach is that the stochastic gradient introduces an unknown amount of noise which can prevent proper sampling after discretization. To remedy this problem, we show that one can leverage a small number of additional variables to stabilize momentum fluctuations induced by the unknown noise. Our method is inspired by the idea of a thermostat in statistical physics and is justified by a general theory.

artificial intelligence, machine learning, sgnht, (15 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.85)

Add feedback

Review for NeurIPS paper: Convergence and Stability of Graph Convolutional Networks on Large Random Graphs

Neural Information Processing SystemsFeb-8-2025, 07:20:26 GMT

Summary and Contributions: This paper presents theoretical analysis of convergence and stability properties of GCNs on large random graphs. It introduces continuous GCNs (c-GCN) that act on a bounded, piecewise-Lipschitz function of unobserved latent node variables which are linked through a similarity kernel. It has two main contributions. Firstly, it studies notions of invariance and equivariance to isomorphism of random graph models, and give convergence results of discrete GCNs to c-GCNs for large graphs. Specifically, for the invariant case the authors claim that the output of both networks lie in the same output space.

artificial intelligence, convergence and stability, graph convolutional network, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.90)

Add feedback

Review for NeurIPS paper: Convergence and Stability of Graph Convolutional Networks on Large Random Graphs

Neural Information Processing SystemsFeb-8-2025, 07:20:18 GMT

This paper considers a continuous version of graph convolutional neural network and analyze the usual discrete GCN as a discrete approximation of the continuous one. Under some random graph generative models, the convergence rate of the discrete one to the continuous one is derived. Moreover, some stability results are given to show that the induced GCN is stable against perturbation of the underlying generative model. The analysis is interesting and the expositions are well written. This kind of continuous-to-discrete type analysis would facilitate further theoretical analysis to understand GCN in general. Therefore, this paper is worth publication in NeurIPS.

convergence and stability, graph convolutional network, neurips paper, (3 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.68)

Add feedback

Review for NeurIPS paper: A General Large Neighborhood Search Framework for Solving Integer Linear Programs

Neural Information Processing SystemsFeb-7-2025, 15:02:06 GMT

Additional Feedback: I wonder whether the used LNS requires a local search algorithm for solving the subproblem (Line 3). The authors argue that they set \gamma to 1 because it is a finite-horizon task. I completely agree that this is a possible choice; however even for finite-horizon tasks, \gamma can be set to values smaller than 1.0. I wonder how sensitive their approach is to such hyperparameters. The authors sampled 5 trajectories for each problem (instance?) to estimate the policy gradient. I'm not sure whether I understood that point fully.

integer linear program, neighborhood search framework, neurips paper, (3 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.61)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.40)

Add feedback

Review for NeurIPS paper: A General Large Neighborhood Search Framework for Solving Integer Linear Programs

Neural Information Processing SystemsFeb-7-2025, 15:01:59 GMT

This paper received positive reviews from all three reviewers but during the discussion there was widespread concern about whether the contribution is of sufficient significance for a NeurIPS publication. In particular, the question was raised whether a paper that merely applies ML techniques in a new application domain was of sufficient significance. I also read the paper and the author's rebuttal and I very much agree with the authors on this point: application papers have always been a part of the major ML conferences and can help drive the field forward. I am therefore happy to recommend acceptance and encourage the authors to spend more text in the final version towards motivating the problem to a general audience.

integer linear program, neighborhood search framework, neurips paper, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.40)
Information Technology > Artificial Intelligence > Machine Learning (0.32)

Add feedback

Review for NeurIPS paper: GCOMB: Learning Budget-constrained Combinatorial Algorithms over Billion-sized Graphs

Neural Information Processing SystemsFeb-7-2025, 14:43:09 GMT

Weaknesses: The main weaknesses of the paper are that the work only uses a naïve version of the greedy algorithm rather than the faster lazy greedy algorithm, and that it seems to claim more than the results suggest without further investigation in terms of the scope of applicability, and performance improvements over the greedy algorithm. The approach seems to be specialized to selecting a set of elements for coverage-like problems and specifically submodular maximization problems which admit greedy approximation algorithms, not necessarily general set combinatorial problems as claimed (it is important to clearly and fairly articulate the claimed scope of the proposed algorithms superior performance). Additionally, the greedy algorithm empirically gives near-optimal performance in the experiments, so it would be useful to know whether this approach performs well for more difficult problems, where greedy is not almost optimal. It would be good to see performance on other more combinatorial problems or nonsubmodular set graph problems, e.g. The score supervision used to train the GCN is highly related to the marginal return that greedy would use to score nodes. In addition, the locality metric seems to directly consider the percent of neighbors of a node which are not currently covered by a partial solution, which is directly related to the coverage problems considered in this work.

combinatorial problem, learning budget-constrained combinatorial algorithm, performance improvement, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.40)

Add feedback

Review for NeurIPS paper: GCOMB: Learning Budget-constrained Combinatorial Algorithms over Billion-sized Graphs

Neural Information Processing SystemsFeb-7-2025, 14:43:02 GMT

Three reviewers rated this paper as weak accept, and one as reject. All reviewers felt the paper combined learning-based techniques effectively to achieve impressive performance on combinatorial optimization problems in massive graphs. Reviewers describe the work as a combination of heuristics and modules consisting of existing techniques, but largely view the overall system as being significant, and comment on its impressive performance and an ablation study to justify individual components. The main criticisms were about missing comparisons to baselines. It was observed that the proposed method essentially does well on submodular coverage style problems where the greedy algorithm is often nearly optimal in practice and its main advantage is being much faster.

baseline, billion-sized graph, learning budget-constrained combinatorial algorithm, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.40)

Add feedback

Review for NeurIPS paper: Learning Linear Programs from Optimal Decisions

Neural Information Processing SystemsFeb-7-2025, 11:54:16 GMT

Weaknesses: Unfortunately, the authors stop at the rather obvious suggestion that one can apply SQP to the NLP in question. The implementation, while using PyTorch and many rather complicated tools, does not seem to scale beyond very small instances. Perhaps most importantly, the implementation is compared to a rather basic algorithm for generic NLP (COBYLA of Powell), rather than the state of the art in general-purpose DFO or specialised methods for inverse optimisation.

learning linear program, neurips paper, optimal decision, (2 more...)

Neural Information Processing Systems

Country: North America > United States > California > Santa Clara County > Palo Alto (0.09)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.40)

Add feedback

Review for NeurIPS paper: Learning Linear Programs from Optimal Decisions

Neural Information Processing SystemsFeb-7-2025, 11:54:09 GMT

All the reviewers agreed that this paper studies an interesting problem and a novel method is presented. Although some reviewers initially raised a concern regarding the novelty, the authors provided a clear response and the concern was appropriately addressed. All reviewers and I agreed to suggest acceptance of this submission. Note however that several reviewers pointed out some important concerns. Please consider revising your paper to address them before submitting camera-ready.

learning linear program, neurips paper, optimal decision, (1 more...)

Neural Information Processing Systems

Genre: Research Report (0.81)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.40)

Add feedback

SyMANTIC: An Efficient Symbolic Regression Method for Interpretable and Parsimonious Model Discovery in Science and Beyond

Muthyala, Madhav R., Sorourifar, Farshud, Peng, You, Paulson, Joel A.

arXiv.org Artificial IntelligenceFeb-5-2025

Symbolic regression (SR) is an emerging branch of machine learning focused on discovering simple and interpretable mathematical expressions from data. Although a wide-variety of SR methods have been developed, they often face challenges such as high computational cost, poor scalability with respect to the number of input dimensions, fragility to noise, and an inability to balance accuracy and complexity. This work introduces SyMANTIC, a novel SR algorithm that addresses these challenges. SyMANTIC efficiently identifies (potentially several) low-dimensional descriptors from a large set of candidates (from $\sim 10^5$ to $\sim 10^{10}$ or more) through a unique combination of mutual information-based feature selection, adaptive feature expansion, and recursively applied $\ell_0$-based sparse regression. In addition, it employs an information-theoretic measure to produce an approximate set of Pareto-optimal equations, each offering the best-found accuracy for a given complexity. Furthermore, our open-source implementation of SyMANTIC, built on the PyTorch ecosystem, facilitates easy installation and GPU acceleration. We demonstrate the effectiveness of SyMANTIC across a range of problems, including synthetic examples, scientific benchmarks, real-world material property predictions, and chaotic dynamical system identification from small datasets. Extensive comparisons show that SyMANTIC uncovers similar or more accurate models at a fraction of the cost of existing SR methods.

artificial intelligence, evolutionary algorithm, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2502.03367

Country: North America > United States (1.00)

Genre: Research Report > New Finding (0.45)

Industry:

Health & Medicine (1.00)
Energy > Energy Storage (1.00)
Materials > Chemicals (0.93)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.92)
(2 more...)

Add feedback