AITopics | Large Language Model

Collaborating Authors

Large Language Model

News Overviews Instructional Materials AI-Alerts Classics

The GPT-3 economy

#artificialintelligenceSep-22-2020, 10:55:15 GMT

Since its release, GPT-3, OpenAI's massive language model, has been the topic of much discussion among developers, researchers, entrepreneurs, and journalists. Most of those discussions have been focused on the capabilities of the AI-powered text generator. But much about GPT-3 remains obscure. The company has opted to commercialize the deep learning model instead of making it freely available to the public. And though the AI has shown to be capable of many interesting feats, it's not yet clear if GPT-3 will become a real product or will join the endless array of abandoned projects that never found a viable business model. Earlier this month, as reported by users who have access to the beta version of the language model, OpenAI declared the initial pricing plan of GPT-3.

gpt-3, large language model, machine learning, (20 more...)

#artificialintelligence

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.94)
Health & Medicine > Therapeutic Area > Immunology (0.94)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.54)

Add feedback

LI artificial intelligence startup can write your emails

#artificialintelligenceSep-22-2020, 10:55:08 GMT

The time you spend writing emails could be cut drastically. A college student has launched a Long Island artificial intelligence startup that writes emails automatically from a few fragmentary notes. Matt Shumer, 20, who launched Melville-based OthersideAI Inc. in July, is beta-testing the software as a Chrome browser extension for users of the Google Gmail service. Versions for Microsoft Corp.'s Outlook and other email clients are planned, said Shumer, who is the company's chief executive. OthersideAI's Quick Response software can reduce time spent on emails by 75% by learning the way "a user thinks and responds," the company said.

large language model, machine learning, natural language, (14 more...)

#artificialintelligence

Country:

North America > United States > California > San Francisco County > San Francisco (0.06)
Asia > Middle East > Israel > Jerusalem District > Jerusalem (0.06)

Industry: Education > Educational Setting (0.75)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.42)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.42)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.42)

Add feedback

The Rise of the Transformers

#artificialintelligenceSep-21-2020, 14:20:21 GMT

Rise of the Transformers with Self-Attention Mechanism The intention of this article is to continue in answering the questions that my friends April Rudin, Tripp Braden, Danielle Guzman and Richard Foster-Fletcher asked about the future of AI. Furthermore Irene Iyakovet interview with me about how

large language model, machine learning, natural language, (20 more...)

#artificialintelligence

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Texas (0.04)

Genre: Research Report (0.47)

Industry:

Leisure & Entertainment (0.47)
Information Technology (0.46)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.95)

Add feedback

GPT-3: new AI can write like a human but don't mistake that for thinking – neuroscientist

#artificialintelligenceSep-21-2020, 02:20:17 GMT

Since it was unveiled earlier this year, the new AI-based language generating software GPT-3 has attracted much attention for its ability to produce passages of writing that are convincingly human-like. Some have even suggested that the program, created by Elon Musk's OpenAI, may be considered or appears to exhibit, something like artificial general intelligence (AGI), the ability to understand or perform any task a human can. This breathless coverage reveals a natural yet aberrant collusion in people's minds between the appearance of language and the capacity to think. Language and thought, though obviously not the same, are strongly and intimately related. And some people tend to assume that language is the ultimate sign of thought.

gpt-3, large language model, machine learning, (13 more...)

#artificialintelligence

Industry: Health & Medicine > Therapeutic Area > Neurology (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

New DeepMind Approach 'Bootstraps' Self-Supervised Learning of Image Representations

#artificialintelligenceSep-20-2020, 02:10:22 GMT

The Cambridge Dictionary defines "bootstrap" as: "to improve your situation or become more successful, without help from others or without advantages that others have." While a machine learning algorithm's strength depends heavily on the quality of data it is fed, an algorithm that can do the work required to improve itself should become even stronger. A team of researchers from DeepMind and Imperial College recently set out to prove that in the arena of computer vision. In the updated paper Bootstrap Your Own Latent – A New Approach to Self-Supervised Learning, the researchers release the source code and checkpoint for their new "BYOL" approach to self-supervised image representation learning along with new theoretical and experimental insights. In computer vision, learning good image representations is critical as it allows for efficient training on downstream tasks. Image representation learning basically leverages neural networks that have been trained to produce good representations.

large language model, machine learning, natural language, (18 more...)

#artificialintelligence

Country: Asia > China (0.07)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.66)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.61)

Add feedback

AI Weekly: Cutting-edge language models can produce convincing misinformation if we don't stop them

#artificialintelligenceSep-19-2020, 17:35:14 GMT

It's been three months since OpenAI launched an API underpinned by cutting-edge language model GPT-3, and it continues to be the subject of fascination within the AI community and beyond. Portland State University computer science professor Melanie Mitchell found evidence that GPT-3 can make primitive analogies, and Columbia University's Raphaël Millière asked GPT-3 to compose a response to the philosophical essays written about it. But as the U.S. presidential election nears, there's growing concern among academics that tools like GPT-3 could be co-opted by malicious actors to foment discord by spreading misinformation, disinformation, and outright lies. In a paper published by the Middlebury Institute of International Studies' Center on Terrorism, Extremism, and Counterterrorism (CTEC), the coauthors find that GPT-3's strength in generating "informational," "influential" text could be leveraged to "radicalize individuals into violent far-right extremist ideologies and behaviors." Bots are increasingly being used around the world to sow the seeds of unrest, either through the spread of misinformation or the amplification of controversial points of view.

gpt-3, large language model, machine learning, (16 more...)

#artificialintelligence

Country:

Asia > Middle East > Saudi Arabia (0.15)
North America > United States > Kentucky (0.05)
North America > Cuba (0.05)
(8 more...)

Genre: Research Report > New Finding (0.71)

Industry:

Media > News (1.00)
Government > Voting & Elections (0.91)
Government > Regional Government > North America Government > United States Government (0.36)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering

Wang, Peifeng, Peng, Nanyun, Ilievski, Filip, Szekely, Pedro, Ren, Xiang

arXiv.org Artificial IntelligenceSep-19-2020

Commonsense question answering (QA) requires background knowledge which is not explicitly stated in a given context. Prior works use commonsense knowledge graphs (KGs) to obtain this knowledge for reasoning. However, relying entirely on these KGs may not suffice, considering their limited coverage and the contextual dependence of their knowledge. In this paper, we augment a general commonsense QA framework with a knowledgeable path generator. By extrapolating over existing paths in a KG with a state-of-the-art language model, our generator learns to connect a pair of entities in text with a dynamic, and potentially novel, multi-hop relational path. Such paths can provide structured evidence for solving commonsense questions without fine-tuning the path generator. Experiments on two datasets show the superiority of our method over previous works which fully rely on knowledge from KGs (with up to 6% improvement in accuracy), across various amounts of training data. Further evaluation suggests that the generated paths are typically interpretable, novel, and relevant to the task.

large language model, machine learning, question answering, (20 more...)

arXiv.org Artificial Intelligence

2005.00691

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > New Mexico (0.04)
Europe > Germany > Berlin (0.04)

Genre: Research Report (0.64)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.71)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)

Add feedback

Artificial intelligence called GPT-3 can write like a human but don't mistake that for thinking

#artificialintelligenceSep-18-2020, 14:01:00 GMT

… created by Elon Musk’s OpenAI, may be considered or appears to exhibit, something like artificial general intelligence (AGI), the ability to understand …

deep learning, large language model, machine learning, (4 more...)

#artificialintelligence

Industry: Media > News (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.98)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.98)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.53)

Add feedback

The Chess Transformer: Mastering Play using Generative Language Models

Noever, David, Ciolino, Matt, Kalin, Josh

arXiv.org Artificial IntelligenceSep-18-2020

This work demonstrates that natural language transformers can support more generic strategic modeling, particularly for text-archived games. In addition to learning natural language skills, the abstract transformer architecture can generate meaningful moves on a chessboard. With further fine-tuning, the transformer learns complex gameplay by training on 2.8 million chess games in Portable Game Notation. After 30,000 training steps, OpenAI's Generative Pre-trained Transformer (GPT-2) optimizes weights for 774 million parameters. This fine-tuned Chess Transformer generates plausible strategies and displays game formations identifiable as classic openings, such as English or the Slav Exchange. Finally, in live play, the novel model demonstrates a human-to-transformer interface that correctly filters illegal moves and provides a novel method to challenge the transformer's chess strategies. We anticipate future work will build on this transformer's promise, particularly in other strategy games where features can capture the underlying complex rule syntax from simple but expressive player annotations.

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2008.04057

Genre: Research Report > Promising Solution (0.54)

Industry: Leisure & Entertainment > Games > Chess (1.00)

Technology: