A Frustratingly Simple Decoding Method for Neural Text Generation
Yang, Haoran, Cai, Deng, Li, Huayang, Bi, Wei, Lam, Wai, Shi, Shuming
–arXiv.org Artificial Intelligence
We introduce a frustratingly simple, super efficient and surprisingly effective decoding method, which we call Frustratingly Simple Decoding (FSD), for neural text generation. The idea behind FSD is straightforward: we build an anti-LM based on previously generated text and use this anti-LM to penalize future generation of what has been generated. The anti-LM can be implemented as simple as an n-gram language model or a vectorized variant. In this way, FSD introduces no extra model parameters and negligible computational overhead (FSD can be as fast as greedy search). Despite the simplicity, FSD is surprisingly effective; Experiments show that FSD can outperform the canonical methods to date (i.e., nucleus sampling) as well as several strong baselines that were proposed recently.
arXiv.org Artificial Intelligence
May-21-2023
- Country:
- Africa (0.68)
- Asia
- Malaysia (0.68)
- Middle East (0.93)
- Europe (1.00)
- North America
- Haiti (0.93)
- United States > California (0.67)
- Genre:
- Personal
- Research Report (1.00)
- Industry:
- Banking & Finance (1.00)
- Energy > Oil & Gas
- Midstream (1.00)
- Government
- Foreign Policy (0.67)
- Military (1.00)
- Regional Government
- Asia Government (0.67)
- North America Government > United States Government (1.00)
- Space Agency (0.68)
- Voting & Elections (0.67)
- Leisure & Entertainment
- Games (0.93)
- Zoo & Circus (0.92)
- Materials > Chemicals
- Commodity Chemicals > Petrochemicals
- LNG (1.00)
- Industrial Gases > Liquified Gas (1.00)
- Commodity Chemicals > Petrochemicals
- Media (1.00)
- Transportation (1.00)
- Technology: