AITopics | training loss

Stable GFlowNets with Probabilistic Guarantees

Lei, Zengxiang, Shreekumar, Ananth, Rosenthal, Jonathan, Song, Ruoyu, Cardenas, Alvaro A., Fremont, Daniel J., Xu, Dongyan, Ukkusuri, Satish, Celik, Z. Berkay

arXiv.org Machine LearningMay-5-2026

Generative Flow Networks (GFlowNets) learn to sample states proportional to an unnormalized reward. Despite their theoretical promise, practical training is often unstable, exhibiting severe loss spikes and mode collapse. To tackle this, we first assess the sensitivity of GFlowNet objectives, demonstrating that a small Total Variation (TV) distance between the learned and target distributions does not preclude unbounded training loss. Motivated by this mismatch, we establish converse guarantees by deriving loss-to-TV bounds that certify global fidelity from bounded trajectory balance losses. Lastly, we propose Stable GFlowNets, an algorithm that leverages our theoretical results to stabilize training, and empirically demonstrate improved training behavior and superior distributional fidelity.

artificial intelligence, gflownet, machine learning, (18 more...)

arXiv.org Machine Learning

2605.01729

Country: North America > United States (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

01d8bae291b1e4724443375634ccfa0e-AuthorFeedback.pdf

Neural Information Processing SystemsApr-30-2026, 19:56:04 GMT

artificial intelligence, local minimum, machine learning, (19 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.30)

Add feedback

4b5deb9a14d66ab0acc3b8a2360cde7c-Supplemental.pdf

Neural Information Processing SystemsApr-25-2026, 18:53:31 GMT

What can linearized neural networks actually say about generalization? As mentioned in the main text, all our models are trained using the same scheme which was selected without any hyperparameter tuning, besides ensuring a good performance on CIFAR2 for the neural networks. Namely, we train using stochastic gradient descent (SGD) to optimize a binary crossentropy loss, with a decaying learning rate starting at 0.05 and momentum set to 0.9. Furthermore, we use a batch size of 128and train for a 100epochs. This is enough to obtain close-to-zero training losses for the neural networks, and converge to a stable test accuracy in the case of the linearized models1.

artificial intelligence, eigenfunction, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.54)

Add feedback

Country:

North America > United States (0.28)
Asia > Middle East > Israel (0.15)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.98)

Add feedback

2130eb640e0a272898a51da41363542d-Supplemental.pdf

Neural Information Processing SystemsApr-25-2026, 01:53:57 GMT

architecture, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England (0.28)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Add feedback

Speedy Performance Estimation for Neural Architecture Search

Neural Information Processing SystemsApr-25-2026, 01:53:53 GMT

Reliable yet efficient evaluation of generalisation performance of a proposed architecture is crucial to the success of neural architecture search (NAS). Traditional approaches face a variety of limitations: training each architecture to completion is prohibitively expensive, early stopped validation accuracy may correlate poorly with fully trained performance, and model-based estimators require large training sets. We instead propose to estimate the final test performance based on a simple measure of training speed. Our estimator is theoretically motivated by the connection between generalisation and training speed, and is also inspired by the reformulation of a PAC-Bayes bound under the Bayesian setting. Our modelfree estimator is simple, efficient, and cheap to implement, and does not require hyperparameter-tuning or surrogate training before deployment. We demonstrate on various NAS search spaces that our estimator consistently outperforms other alternatives in achieving better correlation with the true test performance rankings. We further show that our estimator can be easily incorporated into both query-based and one-shot NAS methods to improve the speed or quality of the search.

artificial intelligence, bayesian inference, machine learning, (17 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England (0.28)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)

Add feedback

Stimulative Training of Residual Networks: ASocial Psychology Perspective of Loafing

Neural Information Processing SystemsApr-24-2026, 19:55:32 GMT

We further verify that stimulative training can well handle the loafing problem on different datasets and residual networks. As shown in Fig. r1, we can see that stimulative training can always improve the performance of a given residual network and all of its sub-networks by a larger margin on various residual networks and benchmark datasets. In other words, different residual networks trained on different datasets invariably suffer from the problem of network loafing, which can be well solved by the proposed stimulative training strategy. Figure r1: Stimulative training can improve the performance of a given residual network and all of its sub-networks significantly. We further verify it on various residual networks and benchmark datasets.

artificial intelligence, ccuracy, machine learning, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback