AITopics

Country:

Asia > China (0.14)
North America > United States > Florida > Broward County (0.04)
North America > United States > District of Columbia > Washington (0.04)
Asia > Indonesia (0.04)

Genre: Overview (0.48)

Industry:

Information Technology > Security & Privacy (1.00)
Transportation > Air (0.94)
Government > Immigration & Customs (0.94)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

arXiv.org Machine LearningNov-6-2019

A Comprehensive Survey on Transfer Learning

Zhuang, Fuzhen, Qi, Zhiyuan, Duan, Keyu, Xi, Dongbo, Zhu, Yongchun, Zhu, Hengshu, Xiong, Hui, He, Qing

Transfer learning aims at improving the performance of target learners on target domains by transferring the knowledge contained in different but related source domains. In this way, the dependence on a large number of target domain data can be reduced for constructing target learners. Due to the wide application prospects, transfer learning has become a popular and promising area in machine learning. Although there are already some valuable and impressive surveys on transfer learning, these surveys introduce approaches in a relatively isolated way and lack the recent advances in transfer learning. As the rapid expansion of the transfer learning area, it is both necessary and challenging to comprehensively review the relevant studies. This survey attempts to connect and systematize the existing transfer learning researches, as well as to summarize and interpret the mechanisms and the strategies in a comprehensive way, which may help readers have a better understanding of the current research status and ideas. Different from previous surveys, this survey paper reviews over forty representative transfer learning approaches from the perspectives of data and model. The applications of transfer learning are also briefly introduced. In order to show the performance of different transfer learning models, twenty representative transfer learning models are used for experiments. The models are performed on three different datasets, i.e., Amazon Reviews, Reuters-21578, and Office-31. And the experimental results demonstrate the importance of selecting appropriate transfer learning models for different applications in practice.

classifier, international conference, proc, (15 more...)

1911.02685

Country:

North America > Canada > Quebec > Montreal (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
(20 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.34)

Industry:

Health & Medicine (1.00)
Education (0.92)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceNov-5-2019, 10:29:48 GMT

Researchers develop machine learning-based detector that stops lateral phishing attacks - Help Net Security

Lateral phishing attacks – scams targeting users from compromised email accounts within an organization – are becoming an increasing concern in the U.S. Whereas in the past attackers would send phishing scams from email accounts external to an organization, recently there's been an explosion of email-borne scams in which an attackers compromise email accounts within organizations, and then uses those accounts to launch internal phishing emails to fellow employees – the kind of attacks known as lateral phishing. And when a phishing email comes from an internal account, the vast majority of email security systems can't stop it. Existing security systems largely detect cyber attacks that come from the outside, relying on signals like IP and domain reputation, which are ineffective when the email comes from an internal source. Lateral phishing attacks are also costly. FBI data shows that these cyberattacks caused more than $12 billion in losses between 2013-2018.

email, email account, lateral, (10 more...)

Country: North America > United States (0.25)

Genre: Overview > Growing Problem (0.36)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (0.57)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.32)

Ahmetoğlu, Alper, Alpaydın, Ethem

Hierarchical Mixtures of Generators for Adversarial Learning

arXiv.org Machine LearningNov-5-2019

Generative adversarial networks (GANs) are deep neural networks that allow us to sample from an arbitrary probability distribution without explicitly estimating the distribution. There is a generator that takes a latent vector as input and transforms it into a valid sample from the distribution. There is also a discriminator that is trained to discriminate such fake samples from true samples of the distribution; at the same time, the generator is trained to generate fakes that the discriminator cannot tell apart from the true samples. Instead of learning a global generator, a recent approach involves training multiple generators each responsible from one part of the distribution. In this work, we review such approaches and propose the hierarchical mixture of generators, inspired from the hierarchical mixture of experts model, that learns a tree structure implementing a hierarchical clustering with soft splits in the decision nodes and local generators in the leaves. Since the generators are combined softly, the whole model is continuous and can be trained using gradient-based optimization, just like the original GAN model. Our experiments on five image data sets, namely, MNIST, FashionMNIST, UTZap50K, Oxford Flowers, and CelebA, show that our proposed model generates samples of high quality and diversity in terms of popular GAN evaluation metrics. The learned hierarchical structure also leads to knowledge extraction.

arxiv preprint arxiv, generator, hierarchical mixture, (13 more...)

1911.02069

Country:

Europe > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)
Asia > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report (0.50)
Overview (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

arXiv.org Machine LearningNov-5-2019

Efficiently Learning Structured Distributions from Untrusted Batches

Chen, Sitan, Li, Jerry, Moitra, Ankur

We study the problem, introduced by Qiao and Valiant, of learning from untrusted batches. Here, we assume $m$ users, all of whom have samples from some underlying distribution $p$ over $1, \ldots, n$. Each user sends a batch of $k$ i.i.d. samples from this distribution; however an $\epsilon$-fraction of users are untrustworthy and can send adversarially chosen responses. The goal is then to learn $p$ in total variation distance. When $k = 1$ this is the standard robust univariate density estimation setting and it is well-understood that $\Omega (\epsilon)$ error is unavoidable. Suprisingly, Qiao and Valiant gave an estimator which improves upon this rate when $k$ is large. Unfortunately, their algorithms run in time exponential in either $n$ or $k$. We first give a sequence of polynomial time algorithms whose estimation error approaches the information-theoretically optimal bound for this problem. Our approach is based on recent algorithms derived from the sum-of-squares hierarchy, in the context of high-dimensional robust estimation. We show that algorithms for learning from untrusted batches can also be cast in this framework, but by working with a more complicated set of test functions. It turns out this abstraction is quite powerful and can be generalized to incorporate additional problem specific constraints. Our second and main result is to show that this technology can be leveraged to build in prior knowledge about the shape of the distribution. Crucially, this allows us to reduce the sample complexity of learning from untrusted batches to polylogarithmic in $n$ for most natural classes of distributions, which is important in many applications. To do so, we demonstrate that these sum-of-squares algorithms for robust mean estimation can be made to handle complex combinatorial constraints (e.g. those arising from VC theory), which may be of independent technical interest.

algorithm, batch, constraint, (17 more...)

1911.02035

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Asia > Afghanistan > Parwan Province > Charikar (0.04)
North America > United States > New York (0.04)
Asia > China (0.04)

Genre:

Overview (0.92)
Research Report (0.64)

Technology:

Information Technology > Data Science (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.66)

Rausch, Johannes, Martinez, Octavio, Bissig, Fabian, Zhang, Ce, Feuerriegel, Stefan

DocParser: Hierarchical Structure Parsing of Document Renderings

arXiv.org Machine LearningNov-5-2019

PDFs, scans) into hierarchical structures is extensively demanded in the daily routines of many real-world applications, and is often a prerequisite step of many downstream NLP tasks. Earlier attempts focused on different but simpler tasks such as the detection of table or cell locations within documents; however, a holistic, principled approach to inferring the complete hierarchical structure in documents is missing. As a remedy, we developed "Doc-Parser": an end-to-end system for parsing the complete document structure - including all text elements, figures, tables, and table cell structures. To the best of our knowledge, Doc-Parser is the first system that derives the full hierarchical document compositions. Given the complexity of the task, annotating appropriate datasets is costly. Therefore, our second contribution is to provide a dataset for evaluating hierarchical document structure parsing. Our third contribution is to propose a scalable learning framework for settings where domain-specific data is scarce, which we address by a novel approach to weak supervision. Our computational experiments confirm the effectiveness of our proposed weak supervision: Compared to the baseline without weak supervision, it improves the mean average precision for detecting document entities by 37.1 % . When classifying hierarchical relations between entity pairs, it improves the F1 score by 27.6 % . 1 Introduction

dataset, detection, weak supervision, (16 more...)

1911.01702

Country:

Europe > Switzerland > Zürich > Zürich (0.04)
Europe > Finland > Pirkanmaa > Tampere (0.04)

Genre:

Research Report > Promising Solution (0.34)
Overview > Innovation (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.93)

arXiv.org Artificial IntelligenceNov-5-2019

Efficient Multi-robot Exploration via Multi-head Attention-based Cooperation Strategy

Liu, Shuqi, Wu, Zhaoxia

The goal of coordinated multi-robot exploration tasks is to employ a team of autonomous robots to explore an unknown environment as quickly as possible. Compared with human-designed methods, which began with heuristic and rule-based approaches, learning-based methods enable individual robots to learn sophisticated and hard-to-design cooperation strategies through deep reinforcement learning technologies. However, in decentralized multi-robot exploration tasks, learning-based algorithms are still far from being universally applicable to the continuous space due to the difficulties associated with area calculation and reward function designing; moreover, existing learning-based methods encounter problems when attempting to balance the historical trajectory issue and target area conflict problem. Furthermore, the scalability of these methods to a large number of agents is poor because of the exponential explosion problem of state space. Accordingly, this paper proposes a novel approach - Multi-head Attention-based Multi-robot Exploration in Continuous Space (MAMECS) - aimed at reducing the state space and automatically learning the cooperation strategies required for decentralized multi-robot exploration tasks in continuous space. Computational geometry knowledge is applied to describe the environment in continuous space and to design an improved reward function to ensure a superior exploration rate. Moreover, the multi-head attention mechanism employed helps to solve the historical trajectory issue in the decentralized multi-robot exploration task, as well as to reduce the quadratic increase of action space.

agent, exploration, exploration task, (12 more...)

arXiv.org Artificial Intelligence

1911.01774

Country:

Asia > China > Liaoning Province > Shenyang (0.04)
Asia > China > Hunan Province (0.04)

Genre:

Research Report > Promising Solution (0.34)
Overview > Innovation (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.88)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.35)

#artificialintelligenceNov-4-2019, 18:54:23 GMT

What's State Of The Art In AutoML in 2019?

More and more industries and organizations are leveraging artificial intelligence to delight customers and cut through the competition. However, development and deployment of deep learning models is time-consuming and costly – often prohibitively costly. That's when automated machine learning (AutoML) comes into play. AutoML solutions can significantly increase the efficiency of ML model development. Even more importantly, they lower the entry barriers for leveraging AI in business settings by allowing people without IT backgrounds to utilize the most advanced ML algorithms.

algorithm, automl, future research, (3 more...)

Country: Asia > China > Hong Kong (0.07)

Genre: Overview (0.62)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.60)

#artificialintelligenceNov-4-2019, 10:17:58 GMT

This Is How Machine Learning Is Changing The UK Financial Services Landscape - Hedge Think

Machine Learning applied to financial services industry has the potential to improve outcomes for both businesses and consumers. And in the UK, firms are beginning to take advantage of this. A recent survey, called'Machine Learning in UK Financial Services', carried out by the Bank of England (BoE) and the Financial Conduct Authority (FCA) has found that two thirds of respondents report they already use it in some form. The median firm uses live ML applications in two business areas and this is expected to more than double within the next three years. The Bank of England (BoE) and Financial Conduct Authority (FCA) have a keen interest in the way that ML is being deployed by financial institutions.

deployment, machine learning, ml application, (9 more...)

Country:

Europe > United Kingdom > England > Greater London > London (0.05)
Europe > Spain > Andalusia > Seville Province > Seville (0.05)

Genre:

Questionnaire & Opinion Survey (0.51)
Overview (0.35)

Industry:

Banking & Finance > Financial Services (1.00)
Government > Regional Government > Europe Government > United Kingdom Government (0.97)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

arXiv.org Artificial IntelligenceNov-4-2019

Algorithms and Statistical Models for Scientific Discovery in the Petabyte Era

Nord, Brian, Connolly, Andrew J., Kinney, Jamie, Kubica, Jeremy, Narayan, Gautaum, Peek, Joshua E. G., Schafer, Chad, Tollerud, Erik J., Avestruz, Camille, Babu, G. Jogesh, Birrer, Simon, Burke, Douglas, Caldeira, João, Caldwell, Douglas A., Carlberg, Joleen K., Chen, Yen-Chi, Dong, Chuanfei, Feigelson, Eric D., Golkhou, V. Zach, Kashyap, Vinay, Li, T. S., Loredo, Thomas, Lucie-Smith, Luisa, Mandel, Kaisey S., Martínez-Galarza, J. R., Miller, Adam A., Natarajan, Priyamvada, Ntampaka, Michelle, Ptak, Andy, Rapetti, David, Shamir, Lior, Siemiginowska, Aneta, Sipőcz, Brigitta M., Smith, Arfon M., Tran, Nhan, Vilalta, Ricardo, Walkowicz, Lucianne M., ZuHone, John

The field of astronomy has arrived at a turning point in terms of size and complexity of both datasets and scientific collaboration. Commensurately, algorithms and statistical models have begun to adapt --- e.g., via the onset of artificial intelligence --- which itself presents new challenges and opportunities for growth. This white paper aims to offer guidance and ideas for how we can evolve our technical and collaborative frameworks to promote efficient algorithmic development and take advantage of opportunities for scientific discovery in the petabyte era. We discuss challenges for discovery in large and complex data sets; challenges and requirements for the next stage of development of statistical methodologies and algorithmic tool sets; how we might change our paradigms of collaboration and education; and the ethical implications of scientists' contributions to widely applicable algorithms and computational modeling. We start with six distinct recommendations that are supported by the commentary following them. This white paper is related to a larger corpus of effort that has taken place within and around the Petabytes to Science Workshops (https://petabytestoscience.github.io/).

algorithm, astronomy, university, (13 more...)

arXiv.org Artificial Intelligence

1911.02479

Country:

North America > United States > Colorado > Boulder County > Boulder (0.04)
North America > United States > Kansas (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
(2 more...)

Genre:

Research Report (0.50)
Overview (0.46)
Instructional Material > Course Syllabus & Notes (0.34)

Industry: Education (1.00)

Technology:

Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)