Reviews: An Online Sequence-to-Sequence Model Using Partial Conditioning
–Neural Information Processing Systems
This is a well-done paper. It attacks a problem that is worthwhile: how to construct and train a sequence-to-sequence model that can operate on-line instead of waiting for an entire input to be received. It clearly describes an architecture for solving the problem, and walks the reader through the issues in the design of each component in the architecture: next-step prediction, the attention mechanism, and modeling the ends of blocks. It clearly explains the challenges that need to be overcome train the model and perform inference with it, and proposes reasonable approximate algorithms for training and inference. The speech recognition experiments used to demonstrate the utility of the transducer model and to explore design issues such as maintenance of recurrent state across block boundaries, block size, design of the attention mechanism, and depth of the model are reasonable.
Neural Information Processing Systems
Jan-20-2025, 08:59:57 GMT
- Country:
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.06)
- Technology: