AITopics | Overview

Collaborating Authors

Overview

A Survey on Transfer Learning in Natural Language Processing

Alyafeai, Zaid, AlShaibani, Maged Saeed, Ahmad, Irfan

arXiv.org Machine LearningMay-31-2020

Deep learning models usually require a huge amount of data. However, these large datasets are not always attainable. This is common in many challenging NLP tasks. Consider Neural Machine Translation, for instance, where curating such large datasets may not be possible specially for low resource languages. Another limitation of deep learning models is the demand for huge computing resources. These obstacles motivate research to question the possibility of knowledge transfer using large trained models. The demand for transfer learning is increasing as many large models are emerging. In this survey, we feature the recent transfer learning advances in the field of NLP. We also provide a taxonomy for categorizing different transfer learning approaches from the literature.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Machine Learning

2007.04239

Country: Asia > Middle East > Saudi Arabia > Eastern Province > Dhahran (0.14)

Genre:

Research Report (1.00)
Overview (1.00)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

AI Research Considerations for Human Existential Safety (ARCHES)

Critch, Andrew, Krueger, David

arXiv.org Artificial IntelligenceMay-29-2020

Framed in positive terms, this report examines how technical AI research might be steered in a manner that is more attentive to humanity's long-term prospects for survival as a species. In negative terms, we ask what existential risks humanity might face from AI development in the next century, and by what principles contemporary technical research might be directed to address those risks. A key property of hypothetical AI technologies is introduced, called \emph{prepotence}, which is useful for delineating a variety of potential existential risks from artificial intelligence, even as AI paradigms might shift. A set of \auxref{dirtot} contemporary research \directions are then examined for their potential benefit to existential safety. Each research direction is explained with a scenario-driven motivation, and examples of existing work from which to build. The research directions present their own risks and benefits to society that could occur at various scales of impact, and in particular are not guaranteed to benefit existential safety if major developments in them are deployed without adequate forethought and oversight. As such, each direction is accompanied by a consideration of potentially negative side effects.

machine learning, reinforcement learning, simulation of human behavior, (22 more...)

arXiv.org Artificial Intelligence

2006.04948

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > Canada > Ontario > Toronto (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.13)
(14 more...)

Genre:

Overview (1.00)
Instructional Material (1.00)
Research Report > Experimental Study (0.92)
Research Report > New Finding (0.92)

Industry:

Transportation > Air (1.00)
Law Enforcement & Public Safety (1.00)
Information Technology > Security & Privacy (1.00)
(11 more...)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(10 more...)

Add feedback

Introduction to Linear Algebra for Applied Machine Learning with Python

#artificialintelligenceMay-28-2020, 04:32:39 GMT

Linear algebra is to machine learning as flour to bakery: every machine learning model is based in linear algebra, as every cake is based in flour. It is not the only ingredient, of course. Machine learning models need vector calculus, probability, and optimization, as cakes need sugar, eggs, and butter. Applied machine learning, like bakery, is essentially about combining these mathematical ingredients in clever ways to create useful (tasty?) models. This document contains introductory level linear algebra notes for applied machine learning. It is meant as a reference rather than a comprehensive review. If you ever get confused by matrix multiplication, don't remember what was the $L_2$ norm, or the conditions for linear independence, this can serve as a quick reference. It also a good introduction for people that don't need a deep understanding of linear algebra, but still want to learn about the fundamentals to read about machine learning or to use pre-packaged machine learning ...

artificial intelligence, machine learning, textit, (19 more...)

#artificialintelligence

Country: Asia > Middle East > Jordan (0.04)

Genre: Overview (0.34)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Generative Adversarial Networks Applied to Observational Health Data

Georges-Filteau, Jeremy, Cirillo, Elisa

arXiv.org Machine LearningMay-28-2020

Having been collected for its primary purpose in patient care, Observational Health Data (OHD) can further benefit patient well-being by sustaining the development of health informatics. However, the potential for secondary usage of OHD continues to be hampered by the fiercely private nature of patient-related data. Generative Adversarial Networks (GAN) have Generative Adversarial Networks (GAN) have recently emerged as a groundbreaking approach to efficiently learn generative models that produce realistic Synthetic Data (SD). However, the application of GAN to OHD seems to have been lagging in comparison to other fields. We conducted a review of GAN algorithms for OHD in the published literature, and report our findings here.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Machine Learning

doi: 10.22541/au.158921777.79483839

2005.1351

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Illinois (0.04)
(9 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Overview (1.00)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
(10 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

A Comprehensive Survey on Outlying Aspect Mining Methods

Samariya, Durgesh, Ma, Jiangang, Aryal, Sunil

arXiv.org Machine LearningMay-27-2020

In recent years, researchers have become increasingly interested in outlying aspect mining. Outlying aspect mining is the task of finding a set of feature(s), where a given data object is different from the rest of the data objects. Remarkably few studies have been designed to address the problem of outlying aspect mining; therefore, little is known about outlying aspect mining approaches and their strengths and weaknesses among researchers. In this work, we have grouped existing outlying aspect mining approaches in three different categories. For each category, we have provided existing work that falls in that category and then provided their strengths and weaknesses in those categories. We also offer time complexity comparison of the current techniques since it is a crucial issue in the real-world scenario. The motive behind this paper is to give a better understanding of the existing outlying aspect mining techniques and how these techniques have been developed.

artificial intelligence, data mining, machine learning, (15 more...)

arXiv.org Machine Learning

2005.02637

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Oceania > Australia (0.04)
North America > United States > New York (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Overview (0.95)

Industry: Materials > Metals & Mining (0.34)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.70)

Add feedback

The prospects of quantum computing in computational molecular biology

Outeiral, Carlos, Strahm, Martin, Shi, Jiye, Morris, Garrett M., Benjamin, Simon C., Deane, Charlotte M.

arXiv.org Machine LearningMay-26-2020

Quantum computers can in principle solve certain problems exponentially more quickly than their classical counterparts. We have not yet reached the advent of useful quantum computation, but when we do, it will affect nearly all scientific disciplines. In this review, we examine how current quantum algorithms could revolutionize computational biology and bioinformatics. There are potential benefits across the entire field, from the ability to process vast amounts of information and run machine learning algorithms far more efficiently, to algorithms for quantum simulation that are poised to improve computational calculations in drug discovery, to quantum algorithms for optimization that may advance fields from protein structure prediction to network analysis. However, these exciting prospects are susceptible to "hype", and it is also important to recognize the caveats and challenges in this new technology. Our aim is to introduce the promise and limitations of emerging quantum computing technologies in the areas of computational molecular biology and bioinformatics.

artificial intelligence, bioinformatics, machine learning, (16 more...)

arXiv.org Machine Learning

doi: 10.1002/wcms.1481

2005.12792

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.28)
Europe > Netherlands > North Holland > Amsterdam (0.04)
North America > United States > New York > Nassau County > Mineola (0.04)
(8 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Biomedical Informatics > Translational Bioinformatics (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)

Add feedback

A Bayesian-inspired, deep learning, semi-supervised domain adaptation technique for land cover mapping

Lucas, Benjamin, Pelletier, Charlotte, Schmidt, Daniel, Webb, Geoffrey I., Petitjean, François

arXiv.org Machine LearningMay-25-2020

Land cover maps are a vital input variable to many types of environmental research and management. While they can be produced automatically by machine learning techniques, these techniques require substantial training data to achieve high levels of accuracy, which are not always available. One technique researchers use when labelled training data are scarce is domain adaptation (DA) -- where data from an alternate region, known as the source domain, are used to train a classifier and this model is adapted to map the study region, or target domain. The scenario we address in this paper is known as semi-supervised DA, where some labelled samples are available in the target domain. In this paper we present Sourcerer, a Bayesian-inspired, deep learning-based, semi-supervised DA technique for producing land cover maps from SITS data. The technique takes a convolutional neural network trained on a source domain and then trains further on the available target domain with a novel regularizer applied to the model weights. The regularizer adjusts the degree to which the model is modified to fit the target data, limiting the degree of change when the target data are few in number and increasing it as target data quantity increases. Our experiments on Sentinel-2 time series images compare Sourcerer with two state-of-the-art semi-supervised domain adaptation techniques and four baseline models. We show that on two different source-target domain pairings Sourcerer outperforms all other methods for any quantity of labelled target data available. In fact, the results on the more difficult target domain show that the starting accuracy of Sourcerer (when no labelled target data are available), 74.2%, is greater than the next-best state-of-the-art method trained on 20,000 labelled target instances.

artificial intelligence, machine learning, target data, (19 more...)

arXiv.org Machine Learning

2005.1193

Country:

Europe > Denmark > Capital Region > Copenhagen (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
(3 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.67)

Industry:

Food & Agriculture > Agriculture (0.46)
Government (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Predicting Online Item-choice Behavior: A Shape-restricted Regression Perspective

Nishimura, Naoki, Sukegawa, Noriyoshi, Takano, Yuichi, Iwanaga, Jiro

arXiv.org Artificial IntelligenceMay-24-2020

This paper examines the relationship between user pageview (PV) histories and their item-choice behavior on an e-commerce website. We focus on PV sequences, which represent time series of the number of PVs for each user--item pair. We propose a shape-restricted optimization model that accurately estimates item-choice probabilities for all possible PV sequences. This model imposes monotonicity constraints on item-choice probabilities by exploiting partial orders for PV sequences, according to the recency and frequency of a user's previous PVs. To improve the computational efficiency of our optimization model, we devise efficient algorithms for eliminating all redundant constraints according to the transitivity of the partial orders. Experimental results using real-world clickstream data demonstrate that our method achieves higher prediction performance than that of a state-of-the-art optimization model and common machine learning methods.

artificial intelligence, machine learning, optimization problem, (17 more...)

arXiv.org Artificial Intelligence

2004.08519

Country:

Asia > Japan > Honshū > Kantō > Ibaraki Prefecture > Tsukuba (0.04)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Japan > Honshū > Kantō > Kanagawa Prefecture (0.04)

Genre:

Overview (1.00)
Research Report > New Finding (0.93)

Industry: Information Technology > Services > e-Commerce Services (0.88)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.90)

Add feedback

Object Detection and Image Segmentation with Deep Learning on Earth Observation Data: A Review-Part I: Evolution and Recent Trends

#artificialintelligenceMay-23-2020, 23:51:42 GMT

Deep learning (DL) has great influence on large parts of science and increasingly established itself as an adaptive method for new challenges in the field of Earth observation (EO). Nevertheless, the entry barriers for EO researchers are high due to the dense and rapidly developing field mainly driven by advances in computer vision (CV). To lower the barriers for researchers in EO, this review gives an overview of the evolution of DL with a focus on image segmentation and object detection in convolutional neural networks (CNN). The survey starts in 2012, when a CNN set new standards in image recognition, and lasts until late 2019. Thereby, we highlight the connections between the most important CNN architectures and cornerstones coming from CV in order to alleviate the evaluation of modern DL models.

artificial intelligence, deep learning, machine learning, (4 more...)

#artificialintelligence

Genre: Overview (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.98)

Add feedback

Artificial Intelligence Market: Global Trends, Opportunities And Industry Forecast To 2026

#artificialintelligenceMay-23-2020, 23:41:54 GMT

The research report on artificial intelligence market, in substance, presents an exclusive understanding of the vast expanse of the business space in question. The report comprises a gist of the industry by means of providing an executive summary, industry insights, industry ecosystem analysis, market segmentation, and global trends. Furthermore, the study also provides deliverables pertaining to the regulatory and competitive landscapes and the strategic perspectives of various industry contenders with respect to the artificial intelligence indutry . However, the major challenges faced by industry players are the low return on investment and the complexity involved in the creation of AI mechanisms and models. Lack of energy-efficient and cost-effective hardware restricts the adoption of such technology in small and medium enterprises, thereby restricting the artificial intelligence market growth during the forecast timeline.

artificial intelligence, artificial intelligence market, natural language, (14 more...)

#artificialintelligence

Country: North America (0.06)

Genre:

Research Report > Experimental Study (0.72)
Overview > Growing Problem (0.62)

Industry:

Energy (0.37)
Information Technology (0.32)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.32)

Add feedback