Reviews: Professor Forcing: A New Algorithm for Training Recurrent Networks

Jan-20-2025, 07:03:32 GMT–Neural Information Processing Systems

The idea in this paper is interesting and well motivated. When training an RNN for generation, using the likelihood of the observed data is not the proper criterion. Even though the problem and approach are interesting, I find the description of the training objective (Sec. Precise remarks follow: - The random variable y is present in two expectations in Eq. (1). Given an RNN with a given sequence of inputs, all the y can be computed.

new algorithm, professor forcing, training recurrent network, (7 more...)

Neural Information Processing Systems

Jan-20-2025, 07:03:32 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence (0.33)