ShortListing Model: A Streamlined SimplexDiffusion for Discrete Variable Generation
Song, Yuxuan, Zhang, Zhe, Pei, Yu, Gong, Jingjing, Yu, Qiying, Zhang, Zheng, Wang, Mingxuan, Zhou, Hao, Liu, Jingjing, Ma, Wei-Ying
–arXiv.org Artificial Intelligence
Generative modeling of discrete variables is challenging yet crucial for applications in natural language processing and biological sequence design. We introduce the Shortlisting Model (SLM), a novel simplex-based diffusion model inspired by progressive candidate pruning. SLM operates on simplex centroids, reducing generation complexity and enhancing scalability. Additionally, SLM incorporates a flexible implementation of classifier-free guidance, enhancing unconditional generation performance. Extensive experiments on DNA promoter and enhancer design, protein design, character-level and large-vocabulary language modeling demonstrate the competitive performance and strong potential of SLM. Our code can be found at https://github.com/GenSI-THUAIR/SLM
arXiv.org Artificial Intelligence
Aug-26-2025
- Country:
- Africa > Nigeria (0.04)
- Asia
- Middle East
- Iraq > Baghdad Governorate
- Baghdad (0.04)
- Jordan (0.04)
- Republic of Türkiye (0.04)
- Syria (0.04)
- Iraq > Baghdad Governorate
- Russia (0.04)
- Middle East
- Europe
- Finland (0.04)
- France (0.04)
- Russia (0.04)
- United Kingdom (0.04)
- North America
- Canada > Quebec (0.04)
- Cuba > Guantánamo Province
- Guantánamo (0.04)
- United States
- Arizona (0.04)
- Illinois (0.04)
- Maryland (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- Texas (0.04)
- Virginia (0.04)
- Genre:
- Research Report (0.50)
- Workflow (0.46)
- Industry:
- Technology: