AITopics | Large Language Model

Collaborating Authors

Large Language Model

News Overviews Instructional Materials AI-Alerts Classics

Co-training Improves Prompt-based Learning for Large Language Models

Lang, Hunter, Agrawal, Monica, Kim, Yoon, Sontag, David

arXiv.org Artificial IntelligenceFeb-1-2022

We demonstrate that co-training (Blum & Mitchell, 1998) can improve the performance of prompt-based learning by using unlabeled data. While prompting has emerged as a promising paradigm for few-shot and zero-shot learning, it is often brittle and requires much larger models compared to the standard supervised setup. We find that co-training makes it possible to improve the original prompt model and at the same time learn a smaller, downstream task-specific model. In the case where we only have partial access to a prompt model (e.g., output probabilities from GPT-3 (Brown et al., 2020)) we learn a calibration model over the prompt outputs. When we have full access to the prompt model's gradients but full finetuning remains prohibitively expensive (e.g., T0 (Sanh et al., 2021)), we learn a set of soft prompt continuous vectors to iteratively update the prompt model. We find that models trained in this manner can significantly improve performance on challenging datasets where there is currently a large gap between prompt-based learning and fully-supervised models.

hypothesis, iteration, soft prompt, (15 more...)

arXiv.org Artificial Intelligence

2202.00828

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Dominican Republic (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

Causal Inference Principles for Reasoning about Commonsense Causality

Zhang, Jiayao, Zhang, Hongming, Roth, Dan, Su, Weijie J.

arXiv.org Artificial IntelligenceJan-31-2022

Commonsense causality reasoning (CCR) aims at identifying plausible causes and effects in natural language descriptions that are deemed reasonable by an average person. Although being of great academic and practical interest, this problem is still shadowed by the lack of a well-posed theoretical framework; existing work usually relies on deep language models wholeheartedly, and is potentially susceptible to confounding co-occurrences. Motivated by classical causal principles, we articulate the central question of CCR and draw parallels between human subjects in observational studies and natural languages to adopt CCR to the potential-outcomes framework, which is the first such attempt for commonsense tasks. We propose a novel framework, ROCK, to Reason O(A)bout Commonsense K(C)ausality, which utilizes temporal signals as incidental supervision, and balances confounding effects using temporal propensities that are analogous to propensity scores. The ROCK implementation is modular and zero-shot, and demonstrates good CCR capabilities on various datasets.

bathroom, homework, student, (13 more...)

arXiv.org Artificial Intelligence

2202.00436

Country:

North America > United States > Pennsylvania (0.04)
North America > United States > New York (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(5 more...)

Genre: Research Report > Experimental Study (0.45)

Industry:

Health & Medicine (1.00)
Consumer Products & Services (1.00)
Education (0.93)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)

Add feedback

2021: DeepMind's Miracle Year

#artificialintelligenceJan-29-2022, 21:57:50 GMT

This essay is a lightly edited version of the 37th issue of the ML4Sci newsletter. You can find more issues here and if you like what you see, feel free to sign up! Year-in-reviews always seemed like an appropriate mix of vogue and cliche, so I figured I'd take my stab at a slightly late two-part series. Google's DeepMind spent 2021 continuing to push out ground-breaking work applying AI to a variety of important scientific problems. This is perhaps one of the best groups to watch for Manhattan-style AI projects. Unrestrained by the incentives and limits of scientific academia (publish or perish, tenuous and short-term federal funding, high labor turnover i.e. grad students), DeepMind is quickly becoming the Bell Labs for AI.

ai model, deepmind, particle accelerator, (16 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.99)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)

Add feedback

人間と見分けがつかないほど自然な文章を書けるAI「GPT-3」の改良版AI「InstructGPT」一般公開、詩も執筆可能

#artificialintelligenceJan-28-2022, 22:44:10 GMT

文章生成AI「GPT-3」はオンライン掲示板で人間とバレずに1週間会話できるほど違和感の少ない文章を生成できることで知られており、Microsoftのプラットフォームに採用されるなど大きな注目を集めています。一方でGPT-3には反イスラム教的なバイアスが存在することが指摘されるなど、生成される文章に偏りがあることも分かっています。そんなGPT-3の学習モデルを改良して偏りを抑えつつ文章生成精度も向上させた文章生成AI「InstructGPT」の一般提供が2022年1月27日に始まりました。

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.63)

Add feedback

Andrea Rios Escudel on LinkedIn: AI in Metaverse, New GPT3 is less toxic, Fresh Examples of AI

#artificialintelligenceJan-28-2022, 15:43:33 GMT

First, financial regulators need to ensure that regulatory oversight delivers on the inclusion and intermediation-enhancing benefits of digital finance without compromising traditional regulatory goals such as financial stability, adequate competition, consumer protection and market integrity. Second, there is a pressing need for a system of data governance that allows consumers and business to exercise control over their data through the granting and withholding of consent to the use and transfer of their data. Developing a user-friendly granular consent-based data governance system with low transaction costs is a challenge that, when successfully addressed, will promote the development of virtual banking worldwide. Hong Kong SAR offers one example of an integrated regulatory framework for virtual banks. The licensing and regulatory regime – also applicable to incumbent banks – aims to manage the full spectrum of risks arising from any source, including the ownership structure, without compromising development objectives that often rest on technological innovation.

andrea rio escudel, fresh example, metaverse, (3 more...)

#artificialintelligence

Country: Asia > China > Hong Kong (0.27)

Industry:

Law (1.00)
Government (1.00)
Banking & Finance > Trading (0.79)

Technology:

Information Technology > e-Commerce > Financial Technology (1.00)
Information Technology > Communications > Social Media (0.85)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)
(2 more...)

Add feedback

Artificial intelligence (AI): The most trending companies on Twitter in Q4 2021

#artificialintelligenceJan-28-2022, 15:43:12 GMT

Verdict has listed five companies that trended the most in Twitter discussions related to AI, using research from GlobalData's Technology Influencer platform. The top companies are the most mentioned companies among Twitter discussions of more than 150 AI experts tracked by GlobalData's Technology Influencer platform during the fourth quarter (Q4) of 2021. Google discovering that AI becomes more aggressive as it becomes more advanced, Google using machine learning to improve chip design, and a new AI developed by Google interpreting and reading sign language aloud were some of the popular discussions on Alphabet Inc in Q4 2021. Mario Pawlowski, CEO of trucking industry news and technology website iTrucker, shared a video on how AI becomes more aggressive as it becomes more advanced. Researchers at technology company Google's AI research company DeepMind conducted a study on AI by developing an AI video game called Gathering.

google, microsoft, popular discussion, (12 more...)

#artificialintelligence

Country:

North America > United States > California > Santa Clara County > Santa Clara (0.06)
North America > United States > Washington > King County > Redmond (0.05)
North America > United States > New York (0.05)
North America > United States > California > Santa Clara County > Mountain View (0.05)

Industry:

Leisure & Entertainment > Games > Computer Games (0.36)
Information Technology > Services (0.35)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.54)

Add feedback

Can Wikipedia Help Offline Reinforcement Learning?

Reid, Machel, Yamada, Yutaro, Gu, Shixiang Shane

arXiv.org Artificial IntelligenceJan-28-2022

Fine-tuning reinforcement learning (RL) models has been challenging because of a lack of large scale off-the-shelf datasets as well as high variance in transferability among different environments. Recent work has looked at tackling offline RL from the perspective of sequence modeling with improved results as result of the introduction of the Transformer architecture. However, when the model is trained from scratch, it suffers from slow convergence speeds. In this paper, we look to take advantage of this formulation of reinforcement learning as sequence modeling and investigate the transferability of pre-trained sequence models on other domains (vision, language) when finetuned on offline RL tasks (control, games). To this end, we also propose techniques to improve transfer between these domains. Results show consistent performance gains in terms of both convergence speed and reward on a variety of environments, accelerating training by 3-6x and achieving state-of-the-art performance in a variety of tasks using Wikipedia-pretrained and GPT2 language models. We hope that this work not only brings light to the potentials of leveraging generic sequence modeling techniques and pre-trained models for RL, but also inspires future work on sharing knowledge between generative modeling tasks of completely different domains.

arxiv preprint arxiv, language model, wikipedia help offline reinforcement learning, (8 more...)

arXiv.org Artificial Intelligence

2201.12122

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
Europe > Germany > Berlin (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.92)

Add feedback

GitHub - deepmind/xmanager

#artificialintelligenceJan-27-2022, 18:30:40 GMT

XManager is a platform for packaging, running and keeping track of machine learning experiments. It currently enables one to launch experiments locally or on Google Cloud Platform (GCP). Interaction with experiments is done via XManager's APIs through Python launch scripts. To get started, install XManager, its prerequisites if needed and follow the tutorial or codelab.ipynb to create and run a launch script. Or, alternatively, a PyPI project is also available.

experiment, specification, xmanager, (13 more...)

#artificialintelligence

Industry: Information Technology > Services (0.62)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

The new version of GPT-3 is much better behaved (and should be less toxic)

MIT Technology ReviewJan-27-2022, 17:00:00 GMT

Large language models like GPT-3 are trained using vast bodies of text, much it taken from the internet, in which they encounter the best and worst of what people put down in words. That is a problem for today's chatbots and text-generation tools. The models soak up toxic language--from text that is racist and misogynistic or that contains more insidious, baked-in prejudices--as well as falsehoods. OpenAI has made IntructGPT the default model for users of its application programming interface (API)--a service that gives access to the company's language models for a fee. GPT-3 will still be available but OpenAI does not recommend using it.

gpt-3, new version, openai, (1 more...)

MIT Technology Review

Industry: Media > News (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.98)

Add feedback

OpenAI rolls out new text-generating models that it claims are less toxic

#artificialintelligenceJan-27-2022, 16:25:37 GMT

Did you miss a session from the Future of Work Summit? Large language models (LLMs) such as OpenAI's GPT-3, which can "write" sentences that read nearly like they were written by a human, can be prompted to perform a range of writing tasks given only a few examples of the tasks. For example, LLMs have been used to create marketing materials and video game levels in addition to recipes, poetry, and movie scripts. But because LLMs learn to write from examples taken from sometimes toxic communities, they can fall victim to parroting misinformation, sexism, ageism, racism, and conspiracies. Efforts have been made to combat toxicity in LLMs -- with mixed results.

gpt-3, instructgpt, openai, (15 more...)

#artificialintelligence

Country: Europe > France (0.05)

Genre: Summary/Review (0.40)

Industry:

Media > News (0.35)
Leisure & Entertainment > Games (0.35)
Law > Civil Rights & Constitutional Law (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.74)

Add feedback