On the Opportunities and Risks of Foundation Models
Bommasani, Rishi, Hudson, Drew A., Adeli, Ehsan, Altman, Russ, Arora, Simran, von Arx, Sydney, Bernstein, Michael S., Bohg, Jeannette, Bosselut, Antoine, Brunskill, Emma, Brynjolfsson, Erik, Buch, Shyamal, Card, Dallas, Castellon, Rodrigo, Chatterji, Niladri, Chen, Annie, Creel, Kathleen, Davis, Jared Quincy, Demszky, Dora, Donahue, Chris, Doumbouya, Moussa, Durmus, Esin, Ermon, Stefano, Etchemendy, John, Ethayarajh, Kawin, Fei-Fei, Li, Finn, Chelsea, Gale, Trevor, Gillespie, Lauren, Goel, Karan, Goodman, Noah, Grossman, Shelby, Guha, Neel, Hashimoto, Tatsunori, Henderson, Peter, Hewitt, John, Ho, Daniel E., Hong, Jenny, Hsu, Kyle, Huang, Jing, Icard, Thomas, Jain, Saahil, Jurafsky, Dan, Kalluri, Pratyusha, Karamcheti, Siddharth, Keeling, Geoff, Khani, Fereshte, Khattab, Omar, Kohd, Pang Wei, Krass, Mark, Krishna, Ranjay, Kuditipudi, Rohith, Kumar, Ananya, Ladhak, Faisal, Lee, Mina, Lee, Tony, Leskovec, Jure, Levent, Isabelle, Li, Xiang Lisa, Li, Xuechen, Ma, Tengyu, Malik, Ali, Manning, Christopher D., Mirchandani, Suvir, Mitchell, Eric, Munyikwa, Zanele, Nair, Suraj, Narayan, Avanika, Narayanan, Deepak, Newman, Ben, Nie, Allen, Niebles, Juan Carlos, Nilforoshan, Hamed, Nyarko, Julian, Ogut, Giray, Orr, Laurel, Papadimitriou, Isabel, Park, Joon Sung, Piech, Chris, Portelance, Eva, Potts, Christopher, Raghunathan, Aditi, Reich, Rob, Ren, Hongyu, Rong, Frieda, Roohani, Yusuf, Ruiz, Camilo, Ryan, Jack, Ré, Christopher, Sadigh, Dorsa, Sagawa, Shiori, Santhanam, Keshav, Shih, Andy, Srinivasan, Krishnan, Tamkin, Alex, Taori, Rohan, Thomas, Armin W., Tramèr, Florian, Wang, Rose E., Wang, William, Wu, Bohan, Wu, Jiajun, Wu, Yuhuai, Xie, Sang Michael, Yasunaga, Michihiro, You, Jiaxuan, Zaharia, Matei, Zhang, Michael, Zhang, Tianyi, Zhang, Xikun, Zhang, Yuhui, Zheng, Lucia, Zhou, Kaitlyn, Liang, Percy
–arXiv.org Artificial Intelligence
AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks. We call these models foundation models to underscore their critically central yet incomplete character. This report provides a thorough account of the opportunities and risks of foundation models, ranging from their capabilities (e.g., language, vision, robotics, reasoning, human interaction) and technical principles(e.g., model architectures, training procedures, data, systems, security, evaluation, theory) to their applications (e.g., law, healthcare, education) and societal impact (e.g., inequity, misuse, economic and environmental impact, legal and ethical considerations). Though foundation models are based on standard deep learning and transfer learning, their scale results in new emergent capabilities,and their effectiveness across so many tasks incentivizes homogenization. Homogenization provides powerful leverage but demands caution, as the defects of the foundation model are inherited by all the adapted models downstream. Despite the impending widespread deployment of foundation models, we currently lack a clear understanding of how they work, when they fail, and what they are even capable of due to their emergent properties. To tackle these questions, we believe much of the critical research on foundation models will require deep interdisciplinary collaboration commensurate with their fundamentally sociotechnical nature.
arXiv.org Artificial Intelligence
Aug-18-2021
- Country:
- Oceania > Australia (0.13)
- Africa (0.13)
- North America
- United States
- Illinois (0.14)
- Texas (0.13)
- Louisiana (0.13)
- California > Los Angeles County (0.13)
- New York > New York County
- New York City (0.27)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Massachusetts > Middlesex County
- Cambridge (0.13)
- Canada > British Columbia
- United States
- Europe
- Germany (0.45)
- Italy (0.27)
- France (0.27)
- Belgium (0.14)
- Netherlands (0.13)
- Denmark (0.13)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.13)
- Asia
- China (0.27)
- Middle East (0.27)
- Russia (0.27)
- Japan (0.13)
- Genre:
- Overview (1.00)
- Instructional Material (1.00)
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Promising Solution (0.92)
- Industry:
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Social Sector (1.00)
- Leisure & Entertainment > Games (1.00)
- Media > News (1.00)
- Banking & Finance > Economy (1.00)
- Materials > Chemicals (0.92)
- Automobiles & Trucks > Manufacturer (0.67)
- Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (0.67)
- Information Technology
- Services (1.00)
- Security & Privacy (1.00)
- Energy
- Power Industry (1.00)
- Oil & Gas (1.00)
- Renewable (0.67)
- Law
- Statutes (1.00)
- Litigation (1.00)
- Intellectual Property & Technology Law (1.00)
- Criminal Law (1.00)
- Civil Rights & Constitutional Law (1.00)
- Environmental Law (0.87)
- Government & the Courts (0.67)
- Health & Medicine
- Pharmaceuticals & Biotechnology (1.00)
- Health Care Providers & Services (1.00)
- Diagnostic Medicine > Imaging (1.00)
- Consumer Health (1.00)
- Government Relations & Public Policy (0.92)
- Health Care Technology > Medical Record (0.67)
- Public Health (0.67)
- Therapeutic Area
- Oncology (1.00)
- Neurology (1.00)
- Infections and Infectious Diseases (1.00)
- Immunology (1.00)
- Government
- Education
- Transportation
- Technology:
- Information Technology > Artificial Intelligence
- Vision > Image Understanding (1.00)
- Issues > Social & Ethical Issues (1.00)
- Cognitive Science > Problem Solving (1.00)
- Representation & Reasoning
- Search (1.00)
- Expert Systems (1.00)
- Agents (1.00)
- Natural Language
- Text Processing (1.00)
- Large Language Model (1.00)
- Chatbot (1.00)
- Machine Learning
- Statistical Learning (1.00)
- Neural Networks > Deep Learning
- Generative AI (0.65)
- Information Technology > Artificial Intelligence