RITA: a Study on Scaling Up Generative Protein Sequence Models

Hesslow, Daniel, Zanichelli, Niccoló, Notin, Pascal, Poli, Iacopo, Marks, Debora

Jul-14-2022–arXiv.org Artificial Intelligence

Downstream open-source experimentation is important to discover surprising and unpredictable capabilities In this work we introduce RITA: a suite of autoregressive that are hard to discern without large-scale experimentation generative models for protein sequences, (Ganguli et al., 2022). This was recently exemplified with up to 1.2 billion parameters, trained on over when independent researchers discovered that AlphaFold 2 280 million protein sequences belonging to the (Jumper et al., 2021) could successfully predict multimer UniRef-100 database. Such generative models interactions, even though it had only been trained to predict hold the promise of greatly accelerating protein the structure of single protein chains (Yoshitaka, 2021; Baek, design. We conduct the first systematic study of 2021). In addition, there exists no systematic study about how capabilities evolve with model size for autoregressive the evolution of capabilities with respect to model size in transformers in the protein domain: the protein domain: Rao et al. (2020) and Rives et al. (2021) we evaluate RITA models in next amino acid prediction, provided such a study for bidirectional transformers, and zero-shot fitness, and enzyme function Madani et al. (2020) simply noted that their largest model prediction, showing benefits from increased scale.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Jul-14-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Massachusetts > Suffolk County > Boston (0.04)
- Europe
  - United Kingdom > England
    - Oxfordshire > Oxford (0.14)
  - France > Île-de-France
    - Paris > Paris (0.04)
- Asia > Middle East
  - Lebanon > Keserwan-Jbeil Governorate > Blat (0.04)

Genre:
- Research Report (0.50)

Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.94)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found