AITopics

2107.07498

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)
(2 more...)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.90)

#artificialintelligenceJul-14-2021, 18:40:10 GMT

AI as New Electricity?

Till April 2020: GPT-2 was the king of AI, with his stunning 1.5B parameters. It is not easy to deal with it. It takes 6GB on your disk, but that's not the problem. The problem is processing speed: you have to wait several minutes for a single inference running on the CPU. With GPU, it would be at least ten times faster, in a case when you have NVidia GPU with at least 24 GB of Video RAM.

conséquence, gpt-3, new electricity, (6 more...)

Country: Asia > China (0.05)

Industry: Information Technology (0.37)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.42)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.42)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.42)

#artificialintelligenceJul-14-2021, 13:30:16 GMT

OpenAI warns AI behind GitHub's Copilot may be susceptible to bias

Join executive leaders at the Data, Analytics, & Intelligent Automation Summit, presented by Accenture. Let the OSS Enterprise newsletter guide your open source journey! Last month, GitHub and OpenAI launched Copilot, a service that provides suggestions for whole lines of code inside development environments like Microsoft Visual Studio. Copilot is powered by an AI model called Codex that's trained on billions of lines of public code, and the companies claim Copilot works with a broad set of frameworks and languages and adapts to the edits developers make, matching their coding styles. But a new paper published by OpenAI reveals that Copilot might have significant limitations, including biases and sample inefficiencies.

codex, copilot, openai, (12 more...)

Country: North America > United States > Massachusetts (0.05)

Genre: Research Report (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

arXiv.org Artificial IntelligenceJul-14-2021

Zero-shot Visual Question Answering using Knowledge Graph

Chen, Zhuo, Chen, Jiaoyan, Geng, Yuxia, Pan, Jeff Z., Yuan, Zonggang, Chen, Huajun

Incorporating external knowledge to Visual Question Answering (VQA) has become a vital practical need. Existing methods mostly adopt pipeline approaches with different components for knowledge matching and extraction, feature learning, etc.However, such pipeline approaches suffer when some component does not perform well, which leads to error propagation and poor overall performance. Furthermore, the majority of existing approaches ignore the answer bias issue -- many answers may have never appeared during training (i.e., unseen answers) in real-word application. To bridge these gaps, in this paper, we propose a Zero-shot VQA algorithm using knowledge graphs and a mask-based learning mechanism for better incorporating external knowledge, and present new answer-based Zero-shot VQA splits for the F-VQA dataset. Experiments show that our method can achieve state-of-the-art performance in Zero-shot VQA with unseen answers, meanwhile dramatically augment existing end-to-end models on the normal F-VQA task.

dataset, external knowledge, knowledge, (17 more...)

2107.05348

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Norway (0.04)
Asia > China > Zhejiang Province > Hangzhou (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.72)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.64)

Bayer, Markus, Kaufhold, Marc-André, Reuter, Christian

A Survey on Data Augmentation for Text Classification

arXiv.org Artificial IntelligenceJul-14-2021

Data augmentation, the artificial creation of training data for machine learning by transformations, is a widely studied research field across machine learning disciplines. While it is useful for increasing the generalization capabilities of a model, it can also address many other challenges and problems, from overcoming a limited amount of training data over regularizing the objective to limiting the amount data used to protect privacy. Based on a precise description of the goals and applications of data augmentation (C1) and a taxonomy for existing works (C2), this survey is concerned with data augmentation methods for textual classification and aims to achieve a concise and comprehensive overview for researchers and practitioners (C3). Derived from the taxonomy, we divided more than 100 methods into 12 different groupings and provide state-of-the-art references expounding which methods are highly promising (C4). Finally, research perspectives that may constitute a building block for future work are given (C5).

augmentation, augmentation method, data augmentation, (14 more...)

2107.03158

Country:

Europe > United Kingdom (0.14)
North America > United States > Texas (0.14)
Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.04)
(2 more...)

Genre:

Overview (1.00)
Summary/Review (0.92)
Research Report > New Finding (0.46)
Research Report > Promising Solution (0.45)

Industry: Information Technology > Security & Privacy (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(3 more...)

arXiv.org Artificial IntelligenceJul-13-2021

How Much Can CLIP Benefit Vision-and-Language Tasks?

Shen, Sheng, Li, Liunian Harold, Tan, Hao, Bansal, Mohit, Rohrbach, Anna, Chang, Kai-Wei, Yao, Zhewei, Keutzer, Kurt

Most existing Vision-and-Language (V&L) models rely on pre-trained visual encoders, using a relatively small set of manually-annotated data (as compared to web-crawled data), to perceive the visual world. However, it has been observed that large-scale pretraining usually can result in better generalization performance, e.g., CLIP (Contrastive Language-Image Pre-training), trained on a massive amount of image-caption pairs, has shown a strong zero-shot capability on various vision tasks. To further study the advantage brought by CLIP, we propose to use CLIP as the visual encoder in various V&L models in two typical scenarios: 1) plugging CLIP into task-specific fine-tuning; 2) combining CLIP with V&L pre-training and transferring to downstream tasks. We show that CLIP significantly outperforms widely-used visual encoders trained with in-domain annotated data, such as BottomUp-TopDown. We achieve competitive or better results on diverse V&L tasks, while establishing new state-of-the-art results on Visual Question Answering, Visual Entailment, and V&L Navigation tasks. We release our code at https://github.com/clip-vil/CLIP-ViL.

dataset, proceedings, visual encoder, (13 more...)

2107.06383

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
South America > Brazil > Paraná > Curitiba (0.04)
North America > United States > North Carolina (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.67)

Van Vaerenbergh, Steven, Pérez-Suay, Adrián

A Classification of Artificial Intelligence Systems for Mathematics Education

arXiv.org Artificial IntelligenceJul-13-2021

This chapter provides an overview of the different Artificial Intelligence (AI) systems that are being used in contemporary digital tools for Mathematics Education (ME). It is aimed at researchers in AI and Machine Learning (ML), for whom we shed some light on the specific technologies that are being used in educational applications; and at researchers in ME, for whom we clarify: i) what the possibilities of the current AI technologies are, ii) what is still out of reach and iii) what is to be expected in the near future. We start our analysis by establishing a high-level taxonomy of AI tools that are found as components in digital ME applications. Then, we describe in detail how these AI tools, and in particular ML, are being used in two key applications, specifically AI-based calculators and intelligent tutoring systems. We finish the chapter with a discussion about student modeling systems and their relationship to artificial general intelligence.

artificial intelligence, mathematics education, van vaerenbergh, (12 more...)

2107.06015

Country:

North America > United States > New York (0.04)
Oceania > Australia > Western Australia > Perth (0.04)
North America > Canada > Quebec > Montreal (0.04)
(3 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Education > Educational Setting (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.87)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.93)
(5 more...)

Bhatt, Gaurav, Chandhok, Shivam, Balasubramanian, Vineeth N

Learn from Anywhere: Rethinking Generalized Zero-Shot Learning with Limited Supervision

arXiv.org Artificial IntelligenceJul-13-2021

A common problem with most zero and few-shot learning approaches is they suffer from bias towards seen classes resulting in sub-optimal performance. Existing efforts aim to utilize unlabeled images from unseen classes (i.e transductive zero-shot) during training to enable generalization. However, this limits their use in practical scenarios where data from target unseen classes is unavailable or infeasible to collect. In this work, we present a practical setting of inductive zero and few-shot learning, where unlabeled images from other out-of-data classes, that do not belong to seen or unseen categories, can be used to improve generalization in any-shot learning. We leverage a formulation based on product-of-experts and introduce a new AUD module that enables us to use unlabeled samples from out-of-data classes which are usually easily available and practically entail no annotation cost. In addition, we also demonstrate the applicability of our model to address a more practical and challenging, Generalized Zero-shot under a limited supervision setting, where even base seen classes do not have sufficient annotated samples.

aud, dataset, learning, (12 more...)

2107.04952

Country: North America > United States > California (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

#artificialintelligenceJul-12-2021, 11:16:42 GMT

Can AI learn from any public code online?

Just days after GitHub announced its new Copilot tool, which generates complementary code for programmers' projects, web developer Kyle Peacock tweeted an oddity he had noticed. "I love to learn new things and build things," the algorithm wrote, when asked to generate an About Me page. While the About Me page was supposedly generated for a fake person, that link goes to the GitHub profile of David Celis, who The Verge can confirm is not a figment of Copilot's imagination. Celis is a coder and GitHub user with popular repositories, and even formerly worked at the company. "I'm not surprised that my public repositories are a part of the training data for Copilot," Celis told The Verge, adding that he was amused by the algorithm reciting his name.

algorithm, fair use, github, (14 more...)

Country: North America > United States > Texas (0.05)

Industry: Law (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.51)

#artificialintelligenceJul-12-2021, 06:10:22 GMT

GitHub's new tool uses AI to craft code. Some developers are furious

Copilot launched last week in an invite-only Technical Preview, promising to save time by responding to users' code with its own smart suggestions. Those suggestions are based on billions of lines of public code that users have publicly contributed to GitHub, using an AI system called Codex from the research company OpenAI. GitHub describes Copilot as the AI equivalent of pair programming, in which two developers work together at a single computer. The idea is that one developer can bring new ideas or spot problems that the other developer might've missed, even if it requires more person-hours to do so.

developer, github, new tool use ai, (3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.67)