Unsupervised learning of distributions on binary vectors using two layer networks

Apr-6-2023, 19:17:50 GMT–Neural Information Processing Systems

We study a particular type of Boltzmann machine with a bipartite graph structure called a harmo(cid:173) nium. Our interest is in using such a machine to model a probability distribution on binary input vectors. We analyze the class of probability distributions that can be modeled by such machines. We then present two learning algorithms for these machines .. The first learning algorithm is the standard gradient ascent heuristic for computing maximum likelihood estimates for the parameters (i.e. The second learning algorithm is a greedy method that creates the hidden units and computes their weights one at a time.

binary vector, layer network, unsupervised learning, (5 more...)

Neural Information Processing Systems

Apr-6-2023, 19:17:50 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Learning Graphical Models (0.68)
  - Representation & Reasoning > Uncertainty (0.43)