AITopics | energy model

We introduce a new machine learning approach for image segmentation that uses a neural network to model the conditional energy of a segmentation given an image. Our approach, combinatorial energy learning for image segmentation (CELIS) places a particular emphasis on modeling the inherent combinatorial nature of dense image segmentation problems. We propose efficient algorithms for learning deep neural networks to model the energy function, and for local optimization of this energy in the space of supervoxel agglomerations. We extensively evaluate our method on a publicly available 3-D microscopy dataset with 25 billion voxels of ground truth data. On an 11 billion voxel test set, we find that our method improves volumetric reconstruction accuracy by more than 20% as compared to two state-of-the-art baseline methods: graph-based segmentation of the output of a 3-D convolutional neural network trained to predict boundaries, as well as a random forest classifier trained to agglomerate supervoxels that were generated by a 3-D convolutional neural network.

artificial intelligence, machine learning, segmentation, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Genre: Research Report (0.46)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

3001ef257407d5a371a96dcd947c7d93-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-11-2026, 19:57:58 GMT

backpropagation, gaussian noise, inception score, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.33)

Add feedback

83ae75c127e2a3ea3315379020f8c19f-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 09:14:02 GMT

dataset, experiment, inference time, (15 more...)

Neural Information Processing Systems

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Industry: Energy (0.33)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)
Information Technology > Artificial Intelligence > Natural Language (0.69)

Add feedback

ESH_Dynamics-20

Neural Information Processing SystemsFeb-8-2026, 20:52:44 GMT

arxiv preprint arxiv, gradient evaluation, integrator, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.71)
Information Technology > Mathematics of Computing (0.69)
(2 more...)

Add feedback

Hamiltonian Dynamics with Non-Newtonian Momentum for Rapid Sampling

Neural Information Processing SystemsDec-24-2025, 04:18:15 GMT

Sampling from an unnormalized probability distribution is a fundamental problem in machine learning with applications including Bayesian modeling, latent factor inference, and energy-based model training. After decades of research, variations of MCMC remain the default approach to sampling despite slow convergence. Auxiliary neural models can learn to speed up MCMC, but the overhead for training the extra model can be prohibitive. We propose a fundamentally different approach to this problem via a new Hamiltonian dynamics with a non-Newtonian momentum. In contrast to MCMC approaches like Hamiltonian Monte Carlo, no stochastic step is required. Instead, the proposed deterministic dynamics in an extended state space exactly sample the target distribution, specified by an energy function, under an assumption of ergodicity. Alternatively, the dynamics can be interpreted as a normalizing flow that samples a specified energy model without training. The proposed Energy Sampling Hamiltonian (ESH) dynamics have a simple form that can be solved with existing ODE solvers, but we derive a specialized solver that exhibits much better performance.

hamiltonian dynamic, name change, non-newtonian momentum, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

EBGAN-MDN: An Energy-Based Adversarial Framework for Multi-Modal Behavior Cloning

Li, Yixiao, Barth, Julia, Kiefer, Thomas, Fraij, Ahmad

arXiv.org Artificial IntelligenceOct-10-2025

Multi-modal behavior cloning faces significant challenges due to mode averaging and mode collapse, where traditional models fail to capture diverse input-output mappings. This problem is critical in applications like robotics, where modeling multiple valid actions ensures both performance and safety. We propose EBGAN-MDN, a framework that integrates energy-based models, Mixture Density Networks (MDNs), and adversarial training. By leveraging a modified InfoNCE loss and an energy-enforced MDN loss, EBGAN-MDN effectively addresses these challenges. Experiments on synthetic and robotic benchmarks demonstrate superior performance, establishing EBGAN-MDN as a effective and efficient solution for multi-modal learning tasks.

artificial intelligence, generator, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2510.07562

Genre:

Research Report (1.00)
Instructional Material > Course Syllabus & Notes (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Q1 (R2): Motivation for parameterizing the score function explicitly, rather than as the gradient of an energy model

Neural Information Processing SystemsOct-2-2025, 11:37:08 GMT

We thank all the reviewers for providing valuable feedback. In what follows, we address specific questions. The main motivation is computational. We will discuss this motivation in Section 2.1. Q2 (R2): Metrics or experiments to assess whether the model is overfitting or memorizing the dataset.

artificial intelligence, deep learning, machine learning, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.33)

Add feedback

ETA: Energy-based Test-time Adaptation for Depth Completion

Chung, Younjoon, Park, Hyoungseob, Rim, Patrick, Zhang, Xiaoran, He, Jihe, Zeng, Ziyao, Cicek, Safa, Hong, Byung-Woo, Duncan, James S., Wong, Alex

arXiv.org Artificial IntelligenceAug-21-2025

We propose a method for test-time adaptation of pretrained depth completion models. Depth completion models, trained on some ``source'' data, often predict erroneous outputs when transferred to ``target'' data captured in novel environmental conditions due to a covariate shift. The crux of our method lies in quantifying the likelihood of depth predictions belonging to the source data distribution. The challenge is in the lack of access to out-of-distribution (target) data prior to deployment. Hence, rather than making assumptions regarding the target distribution, we utilize adversarial perturbations as a mechanism to explore the data space. This enables us to train an energy model that scores local regions of depth predictions as in- or out-of-distribution. We update the parameters of pretrained depth completion models at test time to minimize energy, effectively aligning test-time predictions to those of the source distribution. We call our method ``Energy-based Test-time Adaptation'', or ETA for short. We evaluate our method across three indoor and three outdoor datasets, where ETA improve over the previous state-of-the-art method by an average of 6.94% for outdoors and 10.23% for indoors. Project Page: https://fuzzythecat.github.io/eta.

artificial intelligence, machine learning, prediction, (18 more...)

arXiv.org Artificial Intelligence

2508.05989

Country: Asia (0.28)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (0.94)
(2 more...)

Add feedback

Learning Equivariant Energy Based Models with Equivariant Stein Variational Gradient Descent Priyank Jaini Bosch-Delta Lab University of Amsterdam Lars Holdijk

Neural Information Processing SystemsAug-15-2025, 19:17:52 GMT

We first introduce Equivari-ant Stein V ariational Gradient Descent algorithm - an equivariant sampling method based on Stein's identity for sampling from densities with symmetries.

artificial intelligence, machine learning, svgd, (15 more...)

Neural Information Processing Systems

Country: