[R] [1705.10929] Adversarial Generation of Natural Language • r/MachineLearning
I have also tried extensively to use WGAN's to generate language sequences. I just don't understand why it doesn't converge to results that are as good as Max Likelihood. Even with curriculum learning and peephole LSTM's, you would think it would converge to a good optimum but the results still show that max likelihood is a better approach /. I don't think the cramer gan will make that big of a difference but I think its worth a try to further improve upon this work. Can anyone think of why this doesn't work better than Max Likelihood?
Jun-1-2017, 17:00:13 GMT
- Technology: