Blockwise Parallel Decoding for Deep Autoregressive Models
Mitchell Stern, Noam Shazeer, Jakob Uszkoreit
–Neural Information Processing Systems
To overcome this limitation, we propose a novel blockwise parallel decoding scheme in which we make predictions for multiple time steps in parallel then back off to the longest prefix validated by a scoring model.
Neural Information Processing Systems
Nov-20-2025, 19:52:51 GMT
- Country:
- North America
- Canada > Quebec
- Montreal (0.04)
- United States > California
- Alameda County > Berkeley (0.04)
- Canada > Quebec
- North America
- Genre:
- Research Report (0.48)
- Technology: