Transforming Neural-Net Output Levels to Probability Distributions

Dec-31-1991–Neural Information Processing Systems

John S. Denker and Yann leCun AT&T Bell Laboratories Holmdel, NJ 07733 Abstract (1) The outputs of a typical multi-output classification network do not satisfy the axioms of probability; probabilities should be positive and sum to one. This problem can be solved by treating the trained network as a preprocessor that produces a feature vector that can be further processed, for instance by classical statistical estimation techniques. It is particularly useful to combine these two ideas: we implement the ideas of section 1 using Parzen windows, where the shape and relative size of each window is computed using the ideas of section 2. This allows us to make contact between important theoretical ideas (e.g. the ensemble formalism) and practical techniques (e.g. Our results also shed new light on and generalize the well-known "soft max" scheme. For example, in speech recognition, these numbers represent the probability of C different phonemes; the probabilities of successive segments can be combined using a Hidden Markov Model.

artificial intelligence, neural network, probability, (14 more...)

Neural Information Processing Systems

Dec-31-1991

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.88)
  - Neural Networks (1.00)

Duplicate Docs Excel Report

Title
Transforming Neural-Net Output Levels to Probability Distributions
Transforming Neural-Net Output Levels to Probability Distributions

Similar Docs Excel Report more

Title	Similarity	Source
None found