AITopics

Zhu, Zhuangdi, Lin, Kaixiang, Zhou, Jiayu

Transfer Learning in Deep Reinforcement Learning: A Survey

This paper surveys the field of transfer learning in the problem setting of Reinforcement Learning (RL). RL has been the key solution to sequential decision-making problems. Along with the fast advance of RL in various domains. including robotics and game-playing, transfer learning arises as an important technique to assist RL by leveraging and transferring external expertise to boost the learning process. In this survey, we review the central issues of transfer learning in the RL domain, providing a systematic categorization of its state-of-the-art techniques. We analyze their goals, methodologies, applications, and the RL frameworks under which these transfer learning techniques would be approachable. We discuss the relationship between transfer learning and other relevant topics from an RL perspective and also explore the potential challenges as well as future development directions for transfer learning in RL.

demonstration, machine learning, reinforcement learning, (14 more...)

2009.07888

Country:

North America > United States > Texas > Travis County > Austin (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > Michigan > Ingham County > Lansing (0.04)
(3 more...)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.34)

Industry:

Education (1.00)
Energy > Power Industry (0.92)
Leisure & Entertainment > Games > Computer Games (0.68)
Health & Medicine > Therapeutic Area > Immunology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Letard, Alexandre, Amghar, Tassadit, Camp, Olivier, Gutowski, Nicolas

Partial Bandit and Semi-Bandit: Making the Most Out of Scarce Users' Feedback

Recent works on Multi-Armed Bandits (MAB) and Combinatorial Multi-Armed Bandits (COM-MAB) show good results on a global accuracy metric. This can be achieved, in the case of recommender systems, with personalization. However, with a combinatorial online learning approach, personalization implies a large amount of user feedbacks. Such feedbacks can be hard to acquire when users need to be directly and frequently solicited. For a number of fields of activities undergoing the digitization of their business, online learning is unavoidable. Thus, a number of approaches allowing implicit user feedback retrieval have been implemented. Nevertheless, this implicit feedback can be misleading or inefficient for the agent's learning. Herein, we propose a novel approach reducing the number of explicit feedbacks required by Combinatorial Multi Armed bandit (COM-MAB) algorithms while providing similar levels of global accuracy and learning efficiency to classical competitive methods. In this paper we present a novel approach for considering user feedback and evaluate it using three distinct strategies. Despite a limited number of feedbacks returned by users (as low as 20% of the total), our approach obtains similar results to those of state of the art approaches.

artificial intelligence, data mining, machine learning, (19 more...)

2009.07518

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)

Genre:

Research Report > Promising Solution (1.00)
Overview > Innovation (0.74)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.89)

Efficient Transformers: A Survey

Tay, Yi, Dehghani, Mostafa, Bahri, Dara, Metzler, Donald

Transformer model architectures have garnered immense interest lately due to their effectiveness across a range of domains like language, vision and reinforcement learning. In the field of natural language processing for example, Transformers have become an indispensable staple in the modern deep learning stack. Recently, a dizzying number of "X-former" models have been proposed - Reformer, Linformer, Performer, Longformer, to name a few - which improve upon the original Transformer architecture, many of which make improvements around computational and memory efficiency. With the aim of helping the avid researcher navigate this flurry, this paper characterizes a large and thoughtful selection of recent efficiency-flavored "X-former" models, providing an organized and comprehensive overview of existing work and models across multiple domains.

artificial intelligence, machine learning, natural language, (17 more...)

2009.06732

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)

Genre: Overview (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Zhang, Shiqi, Sridharan, Mohan

A Survey of Knowledge-based Sequential Decision Making under Uncertainty

Reasoning with declarative knowledge (RDK) and sequential decision-making (SDM) are two key research areas in artificial intelligence. RDK methods reason with declarative domain knowledge, including commonsense knowledge, that is either provided a priori or acquired over time, while SDM methods (probabilistic planning and reinforcement learning) seek to compute action policies that maximize the expected cumulative utility over a time horizon; both classes of methods reason in the presence of uncertainty. Despite the rich literature in these two areas, researchers have not fully explored their complementary strengths. In this paper, we survey algorithms that leverage RDK methods while making sequential decisions under uncertainty. We discuss significant developments, open problems, and directions for future work.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

2008.08548

Country:

North America > United States > New York > Broome County > Binghamton (0.04)
Africa > South Sudan > Equatoria > Central Equatoria > Juba (0.04)
North America > United States > Texas (0.04)
(5 more...)

Genre: Overview (0.93)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Bojer, Casper Solheim, Meldgaard, Jens Peder

Kaggle forecasting competitions: An overlooked learning opportunity

arXiv.org Machine LearningSep-16-2020

Competitions play an invaluable role in the field of forecasting, as exemplified through the recent M4 competition. The competition received attention from both academics and practitioners and sparked discussions around the representativeness of the data for business forecasting. Several competitions featuring real-life business forecasting tasks on the Kaggle platform has, however, been largely ignored by the academic community. We believe the learnings from these competitions have much to offer to the forecasting community and provide a review of the results from six Kaggle competitions. We find that most of the Kaggle datasets are characterized by higher intermittence and entropy than the M-competitions and that global ensemble models tend to outperform local single models. Furthermore, we find the strong performance of gradient boosted decision trees, increasing success of neural networks for forecasting, and a variety of techniques for adapting machine learning models to the forecasting task.

competition, neural network, time sery, (15 more...)

arXiv.org Machine Learning

doi: 10.1016/j.ijforecast.2020.07.007

2009.07701

Country:

North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
Africa > Central African Republic > Ombella-M'Poko > Bimbo (0.04)
North America > United States > New York (0.04)

Genre:

Contests & Prizes (0.90)
Research Report (0.82)
Overview (0.66)

Industry:

Education (0.50)
Banking & Finance (0.46)
Retail (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

#artificialintelligenceSep-15-2020, 19:55:38 GMT

Reformer, Longformer, and ELECTRA: Key Updates To Transformer Architecture In 2020

The leading pre-trained language models demonstrate remarkable performance on different NLP tasks, making them a much-welcomed tool for a number of applications, including sentiment analysis, chatbots, text summarization, and so on. However, good performance usually comes at the cost of enormous computational resources that are not accessible by most researchers and business practitioners. To address this issue, different research groups are working on increasing the compute-efficiency and parameter-efficiency of the pre-trained language models without sacrificing their accuracy. Among the novel approaches introduced this year, at least three methods are appraised by the AI community as very promising. To help you stay aware of the latest NLP research advancements, we have summarized the corresponding research papers in an easy-to-read bullet-point format.

large language model, longformer, machine learning, (18 more...)

#artificialintelligence

Genre: Overview (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.52)

#artificialintelligenceSep-15-2020, 00:09:29 GMT

Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods

The interest in Artificial Intelligence (AI) and its applications has seen unprecedented growth in the last few years. This success can be partly attributed to the advancements made in the sub-fields of AI such as Machine Learning (ML), Computer Vision (CV), and Natural Language Processing (NLP). The largest of the growths in these fields has been made possible with deep learning, a sub-area of machine learning, which uses the principles of artificial neural networks. This has created significant interest in the integration of vision and language. The tasks are designed such that they perfectly embrace the ideas of deep learning. In this survey, we focus on ten prominent tasks that integrate language and vision by discussing their problem formulations, methods, existing datasets, evaluation measures, and compare the results obtained with corresponding state-of-the-art methods. Our efforts go beyond earlier surveys which are either task-specific or concentrate only on one type of visual content, i.e., image or video. Furthermore, we also provide some potential future directions in this field of research with an anticipation that this survey brings in innovative thoughts and ideas to address the existing challenges and build new applications.

deep learning, machine learning, vision and language research, (3 more...)

#artificialintelligence

Genre:

Overview (1.00)
Research Report (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.44)

arXiv.org Artificial IntelligenceSep-15-2020

A Visual Analytics Framework for Explaining and Diagnosing Transfer Learning Processes

Ma, Yuxin, Fan, Arlen, He, Jingrui, Nelakurthi, Arun Reddy, Maciejewski, Ross

Many statistical learning models hold an assumption that the training data and the future unlabeled data are drawn from the same distribution. However, this assumption is difficult to fulfill in real-world scenarios and creates barriers in reusing existing labels from similar application domains. Transfer Learning is intended to relax this assumption by modeling relationships between domains, and is often applied in deep learning applications to reduce the demand for labeled data and training time. Despite recent advances in exploring deep learning models with visual analytics tools, little work has explored the issue of explaining and diagnosing the knowledge transfer process between deep learning models. In this paper, we present a visual analytics framework for the multi-level exploration of the transfer learning processes when training deep neural networks. Our framework establishes a multi-aspect design to explain how the learned knowledge from the existing model is transferred into the new learning task when training deep neural networks. Based on a comprehensive requirement and task analysis, we employ descriptive visualization with performance measures and detailed inspections of model behaviors from the statistical, instance, feature, and model structure levels. We demonstrate our framework through two case studies on image classification by fine-tuning AlexNets to illustrate how analysts can utilize our framework.

artificial intelligence, deep learning, machine learning, (17 more...)

2009.06876

Country:

North America > United States > Illinois (0.04)
North America > United States > Arizona (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report (1.00)
Overview (0.93)

Industry:

Health & Medicine (0.93)
Government > Regional Government > North America Government > United States Government (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Machine LearningSep-15-2020

Meta-Learning for Anomaly Classification with Set Equivariant Networks: Application in the Milky Way

Oladosu, Ademola, Xu, Tony, Ekfeldt, Philip, Kelly, Brian A., Cranmer, Miles, Ho, Shirley, Price-Whelan, Adrian M., Contardo, Gabriella

We present a new meta-learning approach for supervised anomaly classification / one-class classification using set equivariant networks. We focus our experiments on an astronomy application. Our problem setting is composed of a set of classification tasks. Each task has a (small) set of positive, labeled examples and a larger set of unlabeled examples. We expect the positive instances to be much more uncommon (i.e. 'anomalies') than the negative ones ('normal' class). We propose a novel use of equivariant networks for this setting. Specifically we use Deep Sets, which was developed for point-clouds and unordered sets and is equivariant to permutation. We propose to consider the set of positive examples of a given task as a 'point-cloud'. The key idea is that the network directly takes as input the set of positive examples in addition to the current example to classify. This allows the model to predict at test-time on new tasks using only positive labeled examples (i.e 'One-Class classification' setting) by design, potentially without retraining. However, the model is trained in a meta-learning regime on a dataset of several tasks with full-supervision (positive and negative labels). This setup is motivated by our target application on stellar streams. Streams are groups of stars sharing specific properties in various features. For a detected stream, we can determine a set of stars that likely belong to the stream. We aim to characterize the membership of all other nearby stars. We build a meta-dataset of simulated streams injected onto real data and evaluate on unseen synthetic streams and one known stream. Our experiments show encouraging results to explore furthermore equivariant networks for anomaly or 'one-class' classification in a meta-learning regime.

artificial intelligence, inductive learning, machine learning, (17 more...)

arXiv.org Machine Learning

2007.04459

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > New York (0.04)

Genre:

Research Report (1.00)
Overview (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.69)