Training Binary Neural Networks via Gaussian Variational Inference and Low-Rank Semidefinite Programming

May-27-2025, 05:37:00 GMT–Neural Information Processing Systems

Current methods for training Binarized Neural Networks (BNNs) heavily rely on the heuristic straight-through estimator (STE), which crucially enables the application of SGD-based optimizers to the combinatorial training problem. Although the STE heuristics and their variants have led to significant improvements in BNN performance, their theoretical underpinnings remain unclear and relatively understudied. In this paper, we propose a theoretically motivated optimization framework for BNN training based on Gaussian variational inference. In its simplest form, our approach yields a non-convex linear programming formulation whose variables and associated gradients motivate the use of latent weights and STE gradients. More importantly, our framework allows us to formulate semidefinite programming (SDP) relaxations to the BNN training task.

artificial intelligence, inference and low-rank semidefinite programming, machine learning, (6 more...)

Neural Information Processing Systems

May-27-2025, 05:37:00 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks (0.64)
  - Representation & Reasoning > Optimization (0.62)