Goto

Collaborating Authors

 foundational


Introducing LLaMA: A foundational, 65-billion-parameter language model

#artificialintelligence

As part of Meta's commitment to open science, today we are publicly releasing LLaMA (Large Language Model Meta AI), a state-of-the-art foundational large language model designed to help researchers advance their work in this subfield of AI. Smaller, more performant models such as LLaMA enable others in the research community who don't have access to large amounts of infrastructure to study these models, further democratizing access in this important, fast-changing field. Training smaller foundation models like LLaMA is desirable in the large language model space because it requires far less computing power and resources to test new approaches, validate others' work, and explore new use cases. Foundation models train on a large set of unlabeled data, which makes them ideal for fine-tuning for a variety of tasks. We are making LLaMA available at several sizes (7B, 13B, 33B, and 65B parameters) and also sharing a LLaMA model card that details how we built the model in keeping with our approach to Responsible AI practices.


Webinar Request: Understanding the Foundational Needs to Support AI/ML

#artificialintelligence

Artificial Intelligence (AI) and Machine Learning (ML) are rapidly entering an organization's day-to-day operations, and data efforts are in full force at nearly all levels of an organization. Knowing and understanding the fundamentals will help ensure these changes and efforts not only have meaning but also allow for value to be gained in laying the foundation for AI/ML opportunities.


Why humanity is needed to propel conversational AI

#artificialintelligence

Were you unable to attend Transform 2022? Check out all of the summit sessions in our on-demand library now! Conversational AI is a subset of artificial intelligence (AI) that allows consumers to interact with computer applications as if they were interacting with another human. According to Deloitte, the global conversational AI market is set to grow by 22% between 2022 and 2025 and is estimated to reach $14 billion by 2025. Providing enhanced language customizations to cater to a highly diverse and vast group of hyper-local audiences, many practical applications of this include financial services, hospital wards and conferences, and can take the form of a translation app or a chatbot.


Reflections on Foundation Models

#artificialintelligence

Our work received an array of responses from a broad range of perspectives; some folks graciously shared their commentaries with us. We see open discourse as necessary for forging the right norms, best practices, and broader ecosystem around foundation models. In this blog post, we talk through why we believe these models are so important and clarify several points in relation to the community response. In addition, we support and encourage further community discussion of these complex issues; feel free to reach out at [email protected]. We define foundation models as models trained on broad data (generally using self-supervision at scale) that can be adapted to a wide range of downstream tasks.


Hitting the Books: How the interplay of science and technology brought about iPhones

Engadget

Scientific research and technological advancement have gone hand-in-hand since the invention of the wheel. Without research, we lack the knowledge base to advance the state of technology and, without technological advancement we lack the functional base for further scientific exploration. Tsao, explore the symbiotic relationship between these two concepts and how their interaction might be modulated to better serve the rapidly accelerating pace of 21st century technoscientific discovery. Excerpted from THE GENESIS OF TECHNOSCIENTIFIC REVOLUTIONS: RETHINKING THE NATURE AND NURTURE OF RESEARCH by VENKATESH NARAYANAMURTI AND JEFFREY Y. TSAO, published by Harvard University Press. The way in which scientific and technological knowledge are hierarchical stems from the nesting discussed in the last chapter, both of scientific facts and explanations and of technological functions and the forms that fulfill them.


Reflections on Foundation Models

#artificialintelligence

Recently, we released our report on foundation models, launched the Stanford Center for Research on Foundation Models (CRFM) as part of the Stanford Institute for Human-Centered AI (HAI), and hosted a workshop to foster community-wide dialogue. Our work received an array of responses from a broad range of perspectives; some folks graciously shared their commentaries with us. We see open discourse as necessary for forging the right norms, best practices, and broader ecosystem around foundation models. In this blog post, we talk through why we believe these models are so important and clarify several points in relation to the community response. In addition, we support and encourage further community discussion of these complex issues; feel free to reach out at contact-crfm@stanford.edu.


How machine learning is changing digital marketing [Q&A]

#artificialintelligence

Increasingly customers expect personalized experiences that are relevant to their unique situations and needs. However, with the increased reliance on technology needed to provide this, the human angle can go by the board. We spoke to Jon Perera, CMO at sales and marketing software specialist Highspot to learn more about how people, processes and technology can be aligned to offer optimum customer experience. BN: Why is the sales process particularly suited to automation? JP: Automation can be useful when applied correctly, but when it comes to the sales process, it's not automation but artificial intelligence that is the key.


Plug and Play Language Models: A Simple Approach to Controlled Text Generation

arXiv.org Artificial Intelligence

Large transformer-based language models (LMs) trained on huge text corpora have shown unparalleled generation capabilities. However, controlling attributes of the generated language (e.g. switching topic or sentiment) is difficult without modifying the model architecture or fine-tuning on attribute-specific data and entailing the significant cost of retraining. We propose a simple alternative: the Plug and Play Language Model (PPLM) for controllable language generation, which combines a pretrained LM with one or more simple attribute classifiers that guide text generation without any further training of the LM. In the canonical scenario we present, the attribute models are simple classifiers consisting of a user-specified bag of words or a single learned layer with 100,000 times fewer parameters than the LM. Sampling entails a forward and backward pass in which gradients from the attribute model push the LM's hidden activations and thus guide the generation. Model samples demonstrate control over a range of topics and sentiment styles, and extensive automated and human annotated evaluations show attribute alignment and fluency. PPLMs are flexible in that any combination of differentiable attribute models may be used to steer text generation, which will allow for diverse and creative applications beyond the examples given in this paper.


A day at the beach: Deep learning for a child

#artificialintelligence

The beach offers a wide open playscape where children are fuelled by curiosity. Whether at the beach or elsewhere outdoors, it helps to take a moment to see the world through the lens of a child who is discovering the world anew, and slow down to be present. Part of what happens through children's play is the exhilaration of making choices. These choices, and their consequences, are part of the child's emerging sense of agency and identity. Children's inquisitive minds crave opportunities that allow them to become designers, builders, mathematicians and innovators of their world.


The Five Weirdest Episodes of the AI Podcast in 2018 The Official NVIDIA Blog

#artificialintelligence

Think of it as a chance to rest your eyes up ahead of the New Year's Day college football frenzy in front of that new big screen TV. Simple story: our podcast does best when, and where, other media don't -- while listeners are trapped in a commute and catching up on the latest episode of Jack Ryan isn't just dangerous, it's illegal. Still, if you're tooling around town picking up supplies for New Year's festivities, or just looking to stick in your earbuds and chill out amidst the bustle of the season, we've had more than a few episodes over the past year that entertain -- as well as enlighten. But sarcasm is no joke. Long before today's sentiment analysis systems struggled to accurately understand human communication, people struggled to understand one another's sarcasm.