AITopics

2104.10649

Country:

Asia > China > Beijing > Beijing (0.07)
Asia > China > Hong Kong (0.05)
North America > United States > Oregon > Multnomah County > Portland (0.04)
Asia > China > Liaoning Province > Dalian (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Wahle, Jan Philip, Ruas, Terry, Meuschke, Norman, Gipp, Bela

Are Neural Language Models Good Plagiarists? A Benchmark for Neural Paraphrase Detection

arXiv.org Artificial IntelligenceMar-23-2021

The rise of language models such as BERT allows for high-quality text paraphrasing. This is a problem to academic integrity, as it is difficult to differentiate between original and machine-generated content. We propose a benchmark consisting of paraphrased articles using recent language models relying on the Transformer architecture. Our contribution fosters future research of paraphrase detection systems as it offers a large collection of aligned original and paraphrased documents, a study regarding its structure, classification experiments with state-of-the-art systems, and we make our findings publicly available.

arxiv, dataset, language model, (13 more...)

2103.1245

Country:

Europe > Germany (0.05)
Europe > Czechia > South Moravian Region > Brno (0.04)
Asia (0.04)

Genre: Research Report > New Finding (0.48)

Industry: Education (0.90)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.48)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.35)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

arXiv.org Artificial IntelligenceMar-22-2021

Cooperative Learning of Zero-Shot Machine Reading Comprehension

Luo, Hongyin, Li, Shang-Wen, Yu, Seunghak, Glass, James

Pretrained language models have significantly improved the performance of down-stream language understanding tasks, including extractive question answering, by providing high-quality contextualized word embeddings. However, learning question answering models still need large-scaled data annotation in specific domains. In this work, we propose a cooperative, self-play learning framework, REGEX, for question generation and answering. REGEX is built upon a masked answer extraction task with an interactive learning environment containing an answer entity REcognizer, a question Generator, and an answer EXtractor. Given a passage with a masked entity, the generator generates a question around the entity, and the extractor is trained to extract the masked entity with the generated question and raw texts. The framework allows the training of question generation and answering models on any text corpora without annotation. We further leverage a reinforcement learning technique to reward generating high-quality questions and to improve the answer extraction model's performance. Experiment results show that REGEX outperforms the state-of-the-art (SOTA) pretrained language models and zero-shot approaches on standard question-answering benchmarks, and yields the new SOTA performance under the zero-shot setting.

answer entity, artificial intelligence, natural language, (19 more...)

2103.07449

Country:

Europe (0.46)
South America > Uruguay (0.14)
North America > United States > New York (0.14)
(2 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Education > Assessment & Standards > Student Performance (0.41)
Energy > Oil & Gas (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

#artificialintelligenceMar-19-2021, 06:10:32 GMT

Okay, the GPT-3 hype seems pretty reasonable – TechCrunch

This morning TechCrunch covered an interesting round for Copy.ai, a startup that employs GPT-3 to help other companies with their writing projects. GPT-3, or Generative Pre-trained Transformer 3, is a piece of AI from the OpenAI group that takes text from the user, and writes a lot more for them. As part of the process of covering the Copy.ai I've long been more curious than afraid of automated writing. So when the Copy team described their very positive impressions of the GPT-3 AI writing tool to TechCrunch during an interview, I was intrigued.

gpt-3, headlime, techcrunch, (2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceMar-19-2021, 04:25:05 GMT

Extra Crunch roundup: Coupang and Roblox debut, driving GPT-3 adoption, startup how-tos, more – TechCrunch

Extra Crunch publishes a variety of article types, but how-tos are my favorite category. For many entrepreneurs, the startup they are trying to get off the ground might be only the second entry on their resume. As a result, they don't have much experience to draw from when it comes to basics like hiring, fundraising and growth marketing. Last week, Natasha Mascarenhas interviewed experts who had some strategic advice for finding the right time to bring a product manager on board. This afternoon, we published a guest post by growth marketer Jessica Li with tips for "how nontechnical talent can build relationships with deep tech companies."

getty image, image credit, new window, (11 more...)

Country:

North America > United States (0.15)
Europe (0.05)
Asia > South Korea (0.05)
Asia > India (0.05)

Industry:

Information Technology (0.70)
Transportation > Ground > Road (0.48)
Transportation > Electric Vehicle (0.48)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.52)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.52)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.52)

#artificialintelligenceMar-18-2021, 02:35:11 GMT

OpenAI's Sam Altman: Artificial Intelligence will generate enough wealth to pay each adult $13,500 a year

Artificial intelligence will create so much wealth that every adult in the United States could be paid $13,500 per year from its windfall as soon as 10 years from now. So says Sam Altman, co-founder and president of San Francisco-headquartered, artificial intelligence-focused nonprofit OpenAI. "My work at OpenAI reminds me every day about the magnitude of the socioeconomic change that is coming sooner than most people believe," Altman, who posted Tuesday. "Software that can think and learn will do more and more of the work that people now do." Altman calls it an "AI revolution," and compares it in magnitude to the agricultural, industrial and computational technological revolutions.

altman, artificial intelligence, sam altman, (7 more...)

Country: North America > United States > California > San Francisco County > San Francisco (0.25)

Industry:

Government (0.51)
Banking & Finance > Trading (0.31)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.83)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.83)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.83)

#artificialintelligenceMar-18-2021, 00:50:49 GMT

Adventures with AI: Here's what happened when I ate a three course meal designed by artificial intelligence

Welcome to Adventures with AI, a column exploring what happens when artificial intelligence takes control of everyday tasks. Eating out is one of my great pleasures; cooking is not. Unfortunately, since the onset of the COVID-19 pandemic, I've been doing a lot of the latter and almost none of the former. Preparing meals has become paricularly tedious during London's latest lockdown. So like an unhappy couple in a sexless marriage, I've been trying to spice things up in my domestic life.

gpt-3, recipe, vegetable, (15 more...)

Country: North America > United States (0.30)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.50)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

#artificialintelligenceMar-18-2021, 00:50:09 GMT

The key to making AI green is quantum computing

We've painted ourselves into another corner with artificial intelligence. We're finally starting to breakthrough the usefulness barrier but we're butting up against the limits of our our ability to responsibly meet our machines' massive energy requirements. At the current rate of growth, it appears we'll have to turn Earth into Coruscant if we want to keep spending unfathomable amounts of energy training systems such as GPT-3 . The problem: Simply put, AI takes too much time and energy to train. A layperson might imagine a bunch of code on a laptop screen when they think about AI development, but the truth is that many of the systems we use today were trained on massive GPU networks, supercomputers, or both.

ai green, ai system, gpt-3, (5 more...)

Country:

North America > United States > New York (0.05)
Europe > Austria > Vienna (0.05)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.41)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.41)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.41)

arXiv.org Artificial IntelligenceMar-18-2021

All NLP Tasks Are Generation Tasks: A General Pretraining Framework

Du, Zhengxiao, Qian, Yujie, Liu, Xiao, Ding, Ming, Qiu, Jiezhong, Yang, Zhilin, Tang, Jie

There have been various types of pretraining architectures including autoregressive models (e.g., GPT), autoencoding models (e.g., BERT), and encoder-decoder models (e.g., T5). On the other hand, NLP tasks are different in nature, with three main categories being classification, unconditional generation, and conditional generation. However, none of the pretraining frameworks performs the best for all tasks, which introduces inconvenience for model development and selection. We propose a novel pretraining framework GLM (General Language Model) to address this challenge. Compared to previous work, our architecture has three major benefits: (1) it performs well on classification, unconditional generation, and conditional generation tasks with one single pretrained model; (2) it outperforms BERT-like models on classification due to improved pretrain-finetune consistency; (3) it naturally handles variable-length blank filling which is crucial for many downstream tasks. Empirically, GLM substantially outperforms BERT on the SuperGLUE natural language understanding benchmark with the same amount of pre-training data. Moreover, GLM with 1.25x parameters of BERT-Large achieves the best performance in NLU, conditional and unconditional generation at the same time, which demonstrates its generalizability to different downstream tasks.

glm, objective, span, (15 more...)

2103.1036

Country:

North America > United States > Wyoming (0.05)
North America > United States > Michigan (0.05)
North America > United States > California > Los Angeles County > Los Angeles (0.04)
(6 more...)

Genre: Research Report (0.50)

Industry:

Leisure & Entertainment > Sports > Football (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Media > Television (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

#artificialintelligenceMar-17-2021, 16:20:18 GMT

TRIC -- Transformer-based Relative Image Captioning

This blog post describes the TRIC model -- an architecture for Relative Image Captioning task that was created as a part of my Master Thesis. All of them are described in my thesis in a pretty concise way so I highly recommend it -- you can find a link right below. But if you want to check them from another source it is also covered. To each of the topics listed above, I have attached a link to my personal favorite resource concerning this particular subject. Earlier this month I defended my master's thesis in Computer Science at the Warsaw University of Technology.

architecture, caption, image captioning, (17 more...)

Country: Europe > Poland > Masovia Province > Warsaw (0.24)

Industry: Information Technology > Services (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.85)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.51)