Autoencoders, Minimum Description Length and Helmholtz Free Energy

Apr-6-2023, 18:57:07 GMT–Neural Information Processing Systems

An autoencoder network uses a set of recognition weights to convert an input vector into a code vector. It then uses a set of generative weights to convert the code vector into an approximate reconstruction of the input vector. We derive an objective function for training autoencoders based on the Minimum Description Length (MDL) principle. The aim is to minimize the information required to describe both the code vector and the reconstruction error. We show that this information is minimized by choosing code vectors stochastically according to a Boltzmann distri(cid:173) bution, where the generative weights define the energy of each possible code vector given the input vector.

artificial intelligence, machine learning, minimum description length, (5 more...)

Neural Information Processing Systems

Apr-6-2023, 18:57:07 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Computational Learning Theory > Minimum Complexity Machines (0.99)
  - Neural Networks (0.94)