CELL-E2: Translating Proteins to Pictures and Back with a Bidirectional Text-to-Image Transformer
–Neural Information Processing Systems
We present CELL-E 2, a novel bidirectional transformer that can generate images depicting protein subcellular localization from the amino acid sequences (and vice versa). Protein localization is a challenging problem that requires integrating sequence and image information, which most existing methods ignore. CELL-E 2 extends the work of CELL-E, not only capturing the spatial complexity of protein localization and produce probability estimates of localization atop a nucleus image, but also being able to generate sequences from images, enabling de novo protein design. We train and finetune CELL-E 2 on two large-scale datasets of human proteins. We also demonstrate how to use CELL-E 2 to create hundreds of novel nuclear localization signals (NLS).
Neural Information Processing Systems
Oct-8-2025, 03:20:38 GMT
- Country:
- North America
- Canada > Ontario
- Toronto (0.04)
- United States > California
- Alameda County > Berkeley (0.04)
- San Francisco County > San Francisco (0.14)
- Canada > Ontario
- North America
- Genre:
- Research Report (0.69)
- Industry:
- Technology: