AITopics | Large Language Model

Collaborating Authors

Large Language Model

News Overviews Instructional Materials AI-Alerts Classics

Create a Text Generation Web App with 100% Python (NLP)

#artificialintelligenceJun-12-2021, 01:43:17 GMT

Create a Text Generation Web App with 100% Python (NLP) - Harness GPT-Neo -- a natural language processing (NLP) text generation model. Demonstrate it with a 100% Python web app Created by Vennify Inc., Eric FillionPreview this Course - GET COUPON CODE GPT-3 is a state-of-the-art text generation natural language processing (NLP) model created by OpenAI. You can use it to generate text that resembles text generated by a human. This course will cover how to create a web app that uses an open-source version of GPT-3 called GPT-Neo with 100% Python. That's right, no HTML, Javascript, CSS or any other programming language is required.

google colab, gpt-neo, python, (12 more...)

#artificialintelligence

Country: North America > Canada (0.07)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Information Technology > Software (1.00)
Education > Educational Setting > Online (0.77)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

CoTexT: Multi-task Learning with Code-Text Transformer

Phan, Long, Tran, Hieu, Le, Daniel, Nguyen, Hieu, Anibal, James, Peltekian, Alec, Ye, Yanfang

arXiv.org Artificial IntelligenceJun-12-2021

We present CoTexT, a pre-trained, transformer-based encoder-decoder model that learns the representative context between natural language (NL) and programming language (PL). Using self-supervision, CoTexT is pre-trained on large programming language corpora to learn a general understanding of language and code. CoTexT supports downstream NL-PL tasks such as code summarizing/documentation, code generation, defect detection, and code debugging. We train CoTexT on different combinations of available PL corpus including both "bimodal" and "unimodal" data. Here, bimodal data is the combination of text and corresponding code snippets, whereas unimodal data is merely code snippets. We first evaluate CoTexT with multi-task learning: we perform Code Summarization on 6 different programming languages and Code Refinement on both small and medium size featured in the CodeXGLUE dataset. We further conduct extensive experiments to investigate CoTexT on other tasks within the CodeXGlue dataset, including Code Generation and Defect Detection. We consistently achieve SOTA results in these tasks, demonstrating the versatility of our models.

cotext, dataset, programming language, (13 more...)

arXiv.org Artificial Intelligence

2105.08645

Country:

North America > United States > Ohio (0.04)
Asia > Vietnam (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

AI and Misinformation: How Artificial Intelligence Works on Both Sides

#artificialintelligenceJun-11-2021, 06:35:16 GMT

One of the growing problems today is misinformation: the proliferation of fake news and misleading content across social media platforms. While artificial intelligence (AI) helps in its spread, there has been growing proof of how it can be used to curb this problem. However, more than just the daily news article, misinformation has far-reaching - and often fearsome - implications in more critical fields such as cybersecurity, public safety, medicine, and even science. In fact, there have been published collaborative papers, one appearing in the April 2021 issue of PNAS, tackling misinformation as a result of common human biases and prevailing practices in the critique and release of scientific papers. This even includes respected, peer-reviewed journals.

ai and misinformation, artificial intelligence work, misinformation, (8 more...)

#artificialintelligence

Country:

North America > United States > Maryland > Baltimore (0.07)
North America > United States > California > San Francisco County > San Francisco (0.06)

Industry:

Media > News (1.00)
Government > Military > Cyberwarfare (0.42)
Health & Medicine > Therapeutic Area > Immunology (0.33)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.34)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Meet Wu Dao 2.0, the Chinese AI model making the West sweat

#artificialintelligenceJun-11-2021, 03:10:06 GMT

A new artificial intelligence model developed by Chinese researchers is performing untold feats with image creation and natural language processing -- making rivals in Europe and the U.S. nervous about falling behind. The model, dubbed Wu Dao 2.0, is able to understand everything people say -- the grammar too -- but can also recognize images and generate realistic pictures based on descriptions. It can also write essays and poems in traditional Chinese, as well as predict the 3D structures of proteins, POLITICO'S AI: Decoded reported. Developed by the government-funded Beijing Academy of Artificial Intelligence and unveiled last week, Wu Dao 2.0 appears to be among the world's most sophisticated AI language models. Wu Dao 2.0's creators say it's 10 times more powerful than its closest rival GPT-3, developed by the U.S. firm OpenAI.

china, language model, wu dao 2, (12 more...)

#artificialintelligence

Country:

North America > United States (0.31)
Asia > China > Beijing > Beijing (0.25)
Europe > Germany (0.06)
(7 more...)

Industry: Government > Regional Government > Europe Government (0.32)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.71)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.56)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.56)

Add feedback

DeepMind says reinforcement learning is 'enough' to reach general AI

#artificialintelligenceJun-10-2021, 18:25:54 GMT

In their decades-long chase to create artificial intelligence, computer scientists have designed and developed all kinds of complicated mechanisms and technologies to replicate vision, language, reasoning, motor skills, and other abilities associated with intelligent life. While these efforts have resulted in AI systems that can efficiently solve specific problems in limited environments, they fall short of developing the kind of general intelligence seen in humans and animals. In a new paper submitted to the peer-reviewed Artificial Intelligence journal, scientists at U.K.-based AI lab DeepMind argue that intelligence and its associated abilities will emerge not from formulating and solving complicated problems but by sticking to a simple but powerful principle: reward maximization. Titled "Reward is Enough," the paper, which is still in pre-proof as of this writing, draws inspiration from studying the evolution of natural intelligence as well as drawing lessons from recent achievements in artificial intelligence. The authors suggest that reward maximization and trial-and-error experience are enough to develop behavior that exhibits the kind of abilities associated with intelligence.

agent, intelligence, reinforcement, (14 more...)

#artificialintelligence

Country: North America > United States > California > San Diego County > San Diego (0.04)

Genre: Research Report > New Finding (0.91)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.59)

Add feedback

OpenAI claims to have mitigated bias and toxicity in GPT-3

#artificialintelligenceJun-10-2021, 16:40:14 GMT

In a study published today, OpenAI, the lab best known for its research on large language models, claims it's discovered a way to improve the "behavior" of language models with respect to ethical, moral, and societal values. The approach, OpenAI says, can give developers the tools to dictate the tone and personality of a model depending on the prompt that the model's given. Despite the potential of natural language models like GPT-3, many blockers exist. The models can't always answer math problems correctly or respond to questions without paraphrasing training data, and it's well-established that they amplify the biases in data on which they were trained. That's problematic in the language domain, because a portion of the data is often sourced from communities with pervasive gender, race, and religious prejudices.

dataset, gpt-3, openai, (16 more...)

#artificialintelligence

Country: North America > United States > California > Alameda County > Berkeley (0.05)

Genre: Research Report (0.50)

Industry: Law > Civil Rights & Constitutional Law (0.71)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.91)

Add feedback

The Little Question I Forgot to Ask Myself to Future-Proof My Work

#artificialintelligenceJun-10-2021, 11:30:07 GMT

I've been writing a few articles in the last months where I've tackled the subject of artificial intelligence (AI) and its incorporation into digital business processes and our daily life. As I was carrying out my search, I came across some resources about the usage of AI to produce art, like painting and music. By letting machines learn from the human artistic work, Artificial Intelligence Virtual Artists like AIVA can compose classical and symphonic music. Today, AIVA's YouTube channel has over 18K subscribers. In her post "Top 10 AI Music Composers in 2021," Lisa Brown has listed more examples of non-human music composers.

future-proof, kuki, lot happier, (13 more...)

#artificialintelligence

Country:

North America > United States > California (0.05)
Asia > China (0.05)

Industry:

Leisure & Entertainment (1.00)
Media > Music (0.76)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.60)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Add feedback

[NDP7] - Machine Learning and What do we do with people's comments?

#artificialintelligenceJun-10-2021, 06:25:37 GMT

We've always thought of this podcast as a dialogue between us and you, so we're shooting for more interactivity! From now on, we're not recording in advance anymore, so that we can answer all the comments you leave us here, on twitter or at [notdailypodcast@gmail.com](mailto:notdailypodcast@gmail.com). This episode is mainly focused on addressing all the comments we've received from episode 1 to 6 so we can start this format from a clean slate! But before that, Yoann introduces us to the GPT2 machine learning algorithm that he trained on a corpus of his writings. You can find all the details in his blog post, and Vlad's reactions in this episode!

machine learning, notdailypodcast, people, (1 more...)

#artificialintelligence

Industry: Media > News (0.40)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.34)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.34)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Programming Puzzles

Schuster, Tal, Kalyan, Ashwin, Polozov, Oleksandr, Kalai, Adam Tauman

arXiv.org Artificial IntelligenceJun-10-2021

We introduce a new type of programming challenge called programming puzzles, as an objective and comprehensive evaluation of program synthesis, and release an open-source dataset of Python Programming Puzzles (P3). Each puzzle is defined by a short Python program $f$, and the goal is to find an input $x$ which makes $f$ output "True". The puzzles are objective in that each one is specified entirely by the source code of its verifier $f$, so evaluating $f(x)$ is all that is needed to test a candidate solution $x$. They do not require an answer key or input/output examples, nor do they depend on natural language understanding. The dataset is comprehensive in that it spans problems of a range of difficulties and domains, ranging from trivial string manipulation problems that are immediately obvious to human programmers (but not necessarily to AI), to classic programming puzzles (e.g., Towers of Hanoi), to interview/competitive-programming problems (e.g., dynamic programming), to longstanding open problems in algorithms and mathematics (e.g., factoring). The objective nature of P3 readily supports self-supervised bootstrapping. We develop baseline enumerative program synthesis and GPT-3 solvers that are capable of solving easy puzzles -- even without access to any reference solutions -- by learning from their own past solutions. Based on a small user study, we find puzzle difficulty to correlate between human programmers and the baseline AI solvers.

bootstrap, puzzle, sol, (16 more...)

arXiv.org Artificial Intelligence

2106.05784

Country:

Asia > Vietnam > Hanoi > Hanoi (0.24)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > New York > New York County > New York City (0.04)
(5 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Leisure & Entertainment > Games (1.00)
Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.92)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.68)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

Pieter Abbeel Team's Decision Transformer Abstracts RL as Sequence Modelling

#artificialintelligenceJun-9-2021, 21:50:13 GMT

Their proposed Decision Transformer outputs optimal actions by leveraging a causally masked transformer and can generate future actions with desired returns. Moreover, despite Decision Transformer's relative simplicity, the proposed framework matches or outperforms the performance of state-of-the-art model-free offline RL baselines on Atari, OpenAI Gym, and Key-to-Door tasks. Transformer architectures are able to efficiently model sequential data, and their self-attention mechanism allows the layer to assign "credit" by implicitly forming state-return associations via maximizing the dot product of the query and key vectors. Transformers can thus function effectively in the presence of sparse or distracting rewards. Previous studies have also shown that transformers can model a wide distribution of behaviours, enabling better generalization and transfer abilities.

decision transformer, sequence, transformer, (11 more...)

#artificialintelligence

Genre: Research Report (0.71)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback