AITopics | Large Language Model

Collaborating Authors

Large Language Model

News Overviews Instructional Materials AI-Alerts Classics

Google's AI Lab, DeepMind, Offers 'Gift to Humanity' with Protein Structure Solution

TIME - TechJul-28-2022, 11:01:37 GMT

Matt Higgins and his team of researchers at the University of Oxford had a problem. For years, they had been studying the parasite that spreads malaria, a disease that still kills hundreds of thousands of people every year. They had identified an important protein on the surface of the parasite as a focal point for a potential future vaccine. They knew its underlying chemical code. But the protein's all-important 3D structure was eluding them. That shape was the key to developing the right vaccine to slide in and block the parasite from infecting human cells.

deepmind, hassabis, protein, (14 more...)

TIME - Tech

Country: Europe > United Kingdom > England > Oxfordshire > Oxford (0.25)

Genre: Research Report (0.35)

Industry:

Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.94)
Health & Medicine > Therapeutic Area > Vaccines (0.73)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.53)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.53)

Add feedback

DeepMind uncovers structure of 200m proteins in scientific leap forward

The GuardianJul-28-2022, 11:00:05 GMT

Artificial intelligence has deciphered the structure of virtually every protein known to science, paving the way for the development of new medicines or technologies to tackle global challenges such as famine or pollution. Proteins are the building blocks of life. Formed of chains of amino acids, folded up into complex shapes, their 3D structure largely determines their function. Once you know how a protein folds up, you can start to understand how it works, and how to change its behaviour. Although DNA provides the instructions for making the chain of amino acids, predicting how they interact to form a 3D shape was more tricky and, until recently, scientists had only deciphered a fraction of the 200m or so proteins known to science.

deepmind uncover structure, protein, scientific leap forward, (10 more...)

The Guardian

Country: Europe > United Kingdom > England > Oxfordshire > Oxford (0.06)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Immunology (0.36)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.45)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.45)

Add feedback

DeepMind has predicted the structure of almost every protein known to science

MIT Technology ReviewJul-28-2022, 11:00:00 GMT

However, for many proteins "we're interested in understanding how their structure is altered by mutations and natural allelic variation, and that won't be addressed by this database," said AlQuraishi. "But of course the field is developing fast, and so I expect tools to accurately model protein variants will begin to appear soon," he added. The quality of AlphaFold's predictions may also not be as accurate for rarer proteins with less available evolutionary information, says Peng. The move is the latest development in DeepMind's push into "digital biology," where "AI and computational methods can help to understand and model important biological processes," said Hassabis. Hassabis also leads a new venture, also owned by Alphabet, called Isomorphic Labs, which is developing AI for drug discovery. Pushmeet Kohli, head of AI for science at DeepMind, said the company has plenty of challenges in the life sciences it still wants to tackle, such as how proteins behave and interact with other proteins.

deepmind, hassabis, protein

MIT Technology Review

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.83)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Add feedback

Best Practices for Deploying Language Models

#artificialintelligenceJul-28-2022, 02:25:13 GMT

Cohere, OpenAI, and AI21 Labs have developed a preliminary set of best practices applicable to any organization developing or deploying large language models. Computers that can read and write are here, and they have the potential to fundamentally impact daily life. The future of human-machine interaction is full of possibility and promise, but any powerful technology needs careful deployment. The joint statement below represents a step towards building a community to address the global challenges presented by AI progress, and we encourage other organizations who would like to participate to get in touch. We're recommending several key principles to help providers of large language models (LLMs) mitigate the risks of this technology in order to achieve its full promise to augment human capabilities.

deploying language model, development and deployment, language model, (9 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Ex-Google engineer Blake Lemoine discusses sentient AI

#artificialintelligenceJul-28-2022, 00:29:16 GMT

Software engineer Blake Lemoine worked with Google's Ethical AI team on Language Model for Dialog Applications (LaMDA), examining the large language model for bias on topics such as sexual orientation, gender, identity, ethnicity, and religion. Over the course of several months, Lemoine, who identifies as a Christian mystic, hypothesized that LaMDA was a living being, based on his spiritual beliefs. Lemoine published transcripts of his conversations with LaMDA and blogs about AI ethics surrounding LaMDA. In June, Google put Lemoine on administrative leave; last week, he was fired. In a statement, Google said Lemoine's claims that LaMDA is sentient are "wholly unfounded." "It's regrettable that despite lengthy engagement on this topic, Blake still chose to persistently violate clear employment and data security policies that include the need to safeguard product information," Google said in a statement.

google, lamda, lemoine, (13 more...)

#artificialintelligence

Genre: Personal > Interview (0.70)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.87)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.53)

Add feedback

Efficient Training of Language Models to Fill in the Middle

Bavarian, Mohammad, Jun, Heewoo, Tezak, Nikolas, Schulman, John, McLeavey, Christine, Tworek, Jerry, Chen, Mark

arXiv.org Artificial IntelligenceJul-28-2022

We show that autoregressive language models can learn to infill text after we apply a straightforward transformation to the dataset, which simply moves a span of text from the middle of a document to its end. While this data augmentation has garnered much interest in recent years, we provide extensive evidence that training models with a large fraction of data transformed in this way does not harm the original left-to-right generative capability, as measured by perplexity and sampling evaluations across a wide range of scales. Given the usefulness, simplicity, and efficiency of training models to fill-in-the-middle (FIM), we suggest that future autoregressive language models be trained with FIM by default. To this end, we run a series of ablations on key hyperparameters, such as the data transformation frequency, the structure of the transformation, and the method of selecting the infill span. We use these ablations to prescribe strong default settings and best practices to train FIM models. We have released our best infilling model trained with best practices in our API, and release our infilling benchmarks to aid future research.

fim rate, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2207.14255

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.28)
Europe > Italy > Tuscany > Florence (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(4 more...)

Genre: Research Report > New Finding (0.92)

Industry: Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

LAD: Language Models as Data for Zero-Shot Dialog

Mehri, Shikib, Altun, Yasemin, Eskenazi, Maxine

arXiv.org Artificial IntelligenceJul-28-2022

However, fine-tuning can be impractical dialog remains elusive. A likely reason for this (e.g., in academic settings) with large LMs (e.g., discrepancy is that dialog models require significant GPT-3) due to the cost, computational power and data because they need to learn task-specific immutable architectures. To this end, this paper structural constraints, such as the domain ontology aims to address the following: 'How can we leverage and the dialog policy. While large language the strong language understanding and generation models (e.g., GPT-3) exhibit strong language understanding abilities of large LMs to facilitate zero-shot and generation abilities (Brown et al., generalization in task-oriented dialog?' 2020), they have no a priori knowledge of the Given the in-context meta-learning abilities of structural constraints implied by a specific (unseen) large LMs (Brown et al., 2020), prior work has problem setting (e.g., relevant intents, dialog policy, explored prompt-engineering or prompt-tuning etc.). As such, in order to adapt a pre-trained (Reynolds and McDonell, 2021; Lester et al., 2021; LM for task-oriented dialog, it is necessary to impose Madotto et al., 2021). Well-designed prompts can structural constraints on the unstructured convey the necessary structural constraints.

dialog, prediction, structural constraint, (14 more...)

arXiv.org Artificial Intelligence

2207.14393

Country:

North America > United States > New York (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > California (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report (0.82)

Industry: Consumer Products & Services (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Instead of AI sentience, focus on the current risks of large language models

#artificialintelligenceJul-27-2022, 19:20:39 GMT

Hear from top leaders discuss topics surrounding AL/ML technology, conversational AI, IVA, NLP, Edge, and more. Recently, a Google engineer made international headlines when he asserted that LaMDA, their system for building chatbots, was sentient. Since his initial post, public debate has raged over whether artificial intelligence (AI) exhibits consciousness and experiences feelings as acutely as humans. While the topic is undoubtedly fascinating, it's also overshadowing other, more pressing risks such as unfairness and privacy loss posed by large-scale language models (LLMs), especially for companies that are racing to integrate these models into their products and services. These risks are further amplified by the fact that the companies deploying these models often lack insight into the specific data and methods used to create them, which can lead to issues of bias, hate speech and stereotyping.

datadecisionmaker, language model, llm, (11 more...)

#artificialintelligence

Industry: Information Technology (0.51)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.91)

Add feedback

The leading job board for Cyber Security Jobs. Start your career in Information Security Industry. Jobs are updated daily.

#artificialintelligenceJul-27-2022, 19:20:35 GMT

Security is at the foundation of OpenAI's mission to ensure that artificial general intelligence benefits all of humanity. The Security team protects OpenAI's technology, people, and products. We are technical in what we build but are operational in how we do our work, and are committed to supporting all products and research at OpenAI. Our Security team tenets include: prioritizing for impact, enabling researchers, preparing for future transformative technologies, and engaging a robust security culture. As a Security Engineering Manager, you will lead OpenAI engineers and researchers in building transformational and secure AI technologies.

cyber security job, information security industry, openai, (3 more...)

#artificialintelligence

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

OpenAI's GPT-3 is a convincing philosopher

#artificialintelligenceJul-27-2022, 19:20:08 GMT

A study has found that OpenAI's GPT-3 is capable of being indistinguishable from a human philosopher. The now infamous GPT-3 is a powerful autoregressive language model that uses deep learning to produce human-like text. Eric Schwitzgebel, Anna Strasser, and Matthew Crosby set out to find out whether GPT-3 can replicate a human philosopher. The team "fine-tuned" GPT-3 based on philosopher Daniel Dennet's corpus. Ten philosophical questions were then posed to both the real Dennet and GPT-3 to see whether the AI could match its renowned human counterpart.

gpt-3, philosopher, schwitzgebel, (9 more...)

#artificialintelligence

Country:

North America > United States > California (0.06)
Europe > Netherlands > North Holland > Amsterdam (0.06)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.64)

Add feedback