Maximum Likelihood Decoding with RNNs - the good, the bad, and the ugly - The Stanford Natural Language Processing Group

Apr-27-2016, 21:35:24 GMT–@machinelearnbot

Training Tensorflow's large language model on the Penn Tree Bank yields a test perplexity of 82. It depends on your personal taste. The high temperature sample displays greater linguistic variety, but the low temperature sample is more grammatically correct. Such is the world of temperature sampling - lowering the temperature allows you to focus on higher probability output sequences and smooth over deficiencies of the model. Temperature sampling works by increasing the probability of the most likely words before sampling.

artificial intelligence, bayesian inference, machine learning, (9 more...)

@machinelearnbot

Apr-27-2016, 21:35:24 GMT

News Web Page

Add feedback

Country:
- North America > United States > California > Santa Clara County > Palo Alto (0.40)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (0.43)
  - Machine Learning > Learning Graphical Models
    - Directed Networks > Bayesian Learning (0.43)