Probabilistically-Sound Beam Search with Masked Language Models
Brooks, Creston, Calef, Robert, Cowen-Breen, Charlie, Sappington, Anna
–arXiv.org Artificial Intelligence
Beam search with masked language models (MLMs) is challenging in part because joint probability distributions over sequences are not readily available, unlike for autoregressive models. However, estimating such distributions has important domain-specific applications such as ancient text restoration and protein engineering. Here we present probabilistically-sound methods for beam search with MLMs. First, we clarify the conditions under which it is theoretically sound to perform text infilling with MLMs using standard beam search. When these conditions fail, we provide a probabilistically-sound modification with no additional computational complexity and demonstrate that it is superior to the aforementioned beam search in the expected conditions. We then present empirical results comparing several infilling approaches with MLMs across several domains.
arXiv.org Artificial Intelligence
Jul-9-2024
- Country:
- North America > United States
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Pennsylvania > Philadelphia County
- Europe
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- United Kingdom > England
- North America > United States
- Genre:
- Research Report (0.82)
- Industry:
- Health & Medicine (0.93)
- Technology: