RITA: a Study on Scaling Up Generative Protein Sequence Models
Hesslow, Daniel, Zanichelli, Niccoló, Notin, Pascal, Poli, Iacopo, Marks, Debora
–arXiv.org Artificial Intelligence
Downstream open-source experimentation is important to discover surprising and unpredictable capabilities In this work we introduce RITA: a suite of autoregressive that are hard to discern without large-scale experimentation generative models for protein sequences, (Ganguli et al., 2022). This was recently exemplified with up to 1.2 billion parameters, trained on over when independent researchers discovered that AlphaFold 2 280 million protein sequences belonging to the (Jumper et al., 2021) could successfully predict multimer UniRef-100 database. Such generative models interactions, even though it had only been trained to predict hold the promise of greatly accelerating protein the structure of single protein chains (Yoshitaka, 2021; Baek, design. We conduct the first systematic study of 2021). In addition, there exists no systematic study about how capabilities evolve with model size for autoregressive the evolution of capabilities with respect to model size in transformers in the protein domain: the protein domain: Rao et al. (2020) and Rives et al. (2021) we evaluate RITA models in next amino acid prediction, provided such a study for bidirectional transformers, and zero-shot fitness, and enzyme function Madani et al. (2020) simply noted that their largest model prediction, showing benefits from increased scale.
arXiv.org Artificial Intelligence
Jul-14-2022
- Country:
- North America > United States
- Massachusetts > Suffolk County > Boston (0.04)
- Europe
- United Kingdom > England
- Oxfordshire > Oxford (0.14)
- France > Île-de-France
- United Kingdom > England
- Asia > Middle East
- Lebanon > Keserwan-Jbeil Governorate > Blat (0.04)
- North America > United States
- Genre:
- Research Report (0.50)
- Industry:
- Technology: