AITopics | Oceania

Collaborating Authors

Oceania

Wasserstein Adversarial Imitation Learning

Xiao, Huang, Herman, Michael, Wagner, Joerg, Ziesche, Sebastian, Etesami, Jalal, Linh, Thai Hong

arXiv.org Machine LearningJun-19-2019

Imitation Learning describes the problem of recovering an expert policy from demonstrations. While inverse reinforcement learning approaches are known to be very sample-efficient in terms of expert demonstrations, they usually require problem-dependent reward functions or a (task-)specific reward-function regularization. In this paper, we show a natural connection between inverse reinforcement learning approaches and Optimal Transport, that enables more general reward functions with desirable properties (e.g., smoothness). Based on our observation, we propose a novel approach called Wasserstein Adversarial Imitation Learning. Our approach considers the Kantorovich potentials as a reward function and further leverages regularized optimal transport to enable large-scale applications. In several robotic experiments, our approach outperforms the baselines in terms of average cumulative rewards and shows a significant improvement in sample-efficiency, by requiring just one expert demonstration.

demonstration, international conference, reward function, (13 more...)

arXiv.org Machine Learning

1906.08113

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
(12 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Solving Multiagent Planning Problems with Concurrent Conditional Effects

Furelos-Blanco, Daniel, Jonsson, Anders

arXiv.org Artificial IntelligenceJun-19-2019

In this work we present a novel approach to solving concurrent multiagent planning problems in which several agents act in parallel. Our approach relies on a compilation from concurrent multiagent planning to classical planning, allowing us to use an off-the-shelf classical planner to solve the original multiagent problem. The solution can be directly interpreted as a concurrent plan that satisfies a given set of concurrency constraints, while avoiding the exponential blowup associated with concurrent actions. Our planner is the first to handle action effects that are conditional on what other agents are doing. Theoretically, we show that the compilation is sound and complete. Empirically, we show that our compilation can solve challenging multiagent planning problems that require concurrent actions.

agent, artificial intelligence, joint action, (14 more...)

arXiv.org Artificial Intelligence

1906.08157

Country:

Oceania > Australia (0.28)
North America > United States (0.28)
Europe > United Kingdom (0.28)

Genre: Research Report (0.84)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

Can artificial intelligence algorithms help regulate extreme speech?

#artificialintelligenceJun-18-2019, 09:30:00 GMT

Following the attacks in Christchurch, New Zealand, in March, social media companies have once again come under growing pressure to "do …

algorithm help regulate extreme speech, artificial intelligence algorithm help regulate

#artificialintelligence

Country: Oceania > New Zealand > South Island > Canterbury Region > Christchurch (0.54)

Industry: Media > News (0.70)

Technology: Information Technology > Artificial Intelligence (0.85)

Add feedback

10 Machine Learning Startups Transforming Their Industries - Disruption Hub

#artificialintelligenceJun-18-2019, 02:37:13 GMT

Artificial intelligence is one of the technologies with the most transformative potential in business. According to research by McKinsey, 70 per cent of companies are likely to have adopted at least one form of AI by 2030. This will contribute to an additional $13tr of global economic activity. Machine learning – a subset of artificial intelligence – enables machines to get better at executing tasks without human intervention, by finding patterns in data, and learning from their experience. It's no surprise, therefore, that there has been an explosion in the number of machine learning companies worldwide.

artificial intelligence, machine learning, startup, (14 more...)

#artificialintelligence

Country:

North America > United States > New York (0.06)
South America > Brazil (0.05)
South America > Argentina (0.05)
(14 more...)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.72)
Information Technology > Security & Privacy (0.71)
Banking & Finance (0.69)
Transportation > Ground > Road (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.32)

Add feedback

Neural Replicator Dynamics

Omidshafiei, Shayegan, Hennes, Daniel, Morrill, Dustin, Munos, Remi, Perolat, Julien, Lanctot, Marc, Gruslys, Audrunas, Lespiau, Jean-Baptiste, Tuyls, Karl

arXiv.org Artificial IntelligenceJun-18-2019

In multiagent learning, agents interact in inherently nonstationary environments due to their concurrent policy updates. It is, therefore, paramount to develop and analyze algorithms that learn effectively despite these nonstationarities. A number of works have successfully conducted this analysis under the lens of evolutionary game theory (EGT), wherein a population of individuals interact and evolve based on biologically-inspired operators. These studies have mainly focused on establishing connections to value-iteration based approaches in stateless or tabular games. We extend this line of inquiry to formally establish links between EGT and policy gradient (PG) methods, which have been extensively applied in single and multiagent learning. We pinpoint weaknesses of the commonly-used softmax PG algorithm in adversarial and nonstationary settings and contrast PG's behavior to that predicted by replicator dynamics (RD), a central model in EGT. We consequently provide theoretical results that establish links between EGT and PG methods, then derive Neural Replicator Dynamics (NeuRD), a parameterized version of RD that constitutes a novel method with several advantages. First, as NeuRD reduces to the well-studied no-regret Hedge algorithm in the tabular setting, it inherits no-regret guarantees that enable convergence to equilibria in games. Second, NeuRD is shown to be more adaptive to nonstationarity, in comparison to PG, when learning in canonical games and imperfect information benchmarks including Poker. Thirdly, modifying any PG-based algorithm to use the NeuRD update rule is straightforward and incurs no added computational costs. Finally, while single-agent learning is not the main focus of the paper, we verify empirically that NeuRD is competitive in these settings with a recent baseline algorithm.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

arXiv.org Artificial Intelligence

1906.0019

Country:

Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States (0.04)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

A Survey on Neural Architecture Search

Wistuba, Martin, Rawat, Ambrish, Pedapati, Tejaswini

arXiv.org Machine LearningJun-18-2019

The growing interest in both the automation of machine learning and deep learning has inevitably led to the development of a wide variety of automated methods for neural architecture search. The choice of the network architecture has proven to be critical, and many advances in deep learning spring from its immediate improvements. However, deep learning techniques are computationally intensive and their application requires a high level of domain knowledge. Therefore, even partial automation of this process helps to make deep learning more accessible to both researchers and practitioners. With this survey, we provide a formalism which unifies and categorizes the landscape of existing methods along with a detailed analysis that compares and contrasts the different approaches. We achieve this via a comprehensive discussion of the commonly adopted architecture search spaces and architecture optimization algorithms based on principles of reinforcement learning and evolutionary algorithms along with approaches that incorporate surrogate and one-shot models. Additionally, we address the new research directions which include constrained and multi-objective architecture search as well as automated data augmentation, optimizer and activation function search.

artificial intelligence, deep learning, machine learning, (16 more...)

arXiv.org Machine Learning

1905.01392

Country:

Oceania > Australia > New South Wales > Sydney (0.14)
North America > United States > California > Los Angeles County > Long Beach (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(24 more...)

Genre:

Overview (0.66)
Research Report (0.51)
Workflow (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Towards robust audio spoofing detection: a detailed comparison of traditional and learned features

BT, Balamurali, Lin, Kin Wah Edward, Lui, Simon, Chen, Jer-Ming, Herremans, Dorien

arXiv.org Machine LearningJun-18-2019

Automatic speaker verification, like every other biometric system, is vulnerable to spoofing attacks. Using only a few minutes of recorded voice of a genuine client of a speaker verification system, attackers can develop a variety of spoofing attacks that might trick such systems. Detecting these attacks using the audio cues present in the recordings is an important challenge. Most existing spoofing detection systems depend on knowing the used spoofing technique. With this research, we aim at overcoming this limitation, by examining robust audio features, both traditional and those learned through an autoencoder, that are generalizable over different types of replay spoofing. Furthermore, we provide a detailed account of all the steps necessary in setting up state-of-the-art audio feature detection, pre-, and postprocessing, such that the (non-audio expert) machine learning researcher can implement such systems. Finally, we evaluate the performance of our robust replay speaker detection system with a wide variety and different combinations of both extracted and machine learned audio features on the `out in the wild' ASVspoof 2017 dataset. This dataset contains a variety of new spoofing configurations. Since our focus is on examining which features will ensure robustness, we base our system on a traditional Gaussian Mixture Model-Universal Background Model. We then systematically investigate the relative contribution of each feature set. The fused models, based on both the known audio features and the machine learned features respectively, have a comparable performance with an Equal Error Rate (EER) of 12. The final best performing model, which obtains an EER of 10.8, is a hybrid model that contains both known and machine learned features, thus revealing the importance of incorporating both types of features when developing a robust spoofing prediction model.

artificial intelligence, audio feature, machine learning, (19 more...)

arXiv.org Machine Learning

1905.12439

Country:

Asia > Singapore (0.05)
Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
Asia > China > Hong Kong (0.04)
(12 more...)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.89)
Information Technology > Artificial Intelligence > Speech > Acoustic Processing (0.88)

Add feedback

Analyzing the Structure of Attention in a Transformer Language Model

Vig, Jesse, Belinkov, Yonatan

arXiv.org Machine LearningJun-18-2019

The Transformer is a fully attention-based alternative to recurrent networks that has achieved state-of-the-art results across a range of NLP tasks. In this paper, we analyze the structure of attention in a Transformer language model, the GPT-2 small pretrained model. We visualize attention for individual instances and analyze the interaction between attention and syntax over a large corpus. We find that attention targets different parts of speech at different layer depths within the model, and that attention aligns with dependency relations most strongly in the middle layers. We also find that the deepest layers of the model capture the most distant relationships. Finally, we extract exemplar sentences that reveal highly specific patterns targeted by particular attention heads.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Machine Learning

1906.04284

Country:

Europe > United Kingdom (0.14)
Europe > Germany (0.04)
Oceania > New Zealand (0.04)
(7 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Distance and Similarity Measures Effect on the Performance of K-Nearest Neighbor Classifier -- A Review

Prasath, V. B. Surya, Alfeilat, Haneen Arafat Abu, Lasassmeh, Omar, Hassanat, Ahmad B. A., Tarawneh, Ahmad S.

arXiv.org Artificial IntelligenceJun-18-2019

The K-nearest neighbor (KNN) classifier is one of the simplest and most common classifiers, yet its performance competes with the most complex classifiers in the literature. The core of this classifier depends mainly on measuring the distance or similarity between the tested example and the training examples. This raises a major question about which distance measures to be used for the KNN classifier among a large number of distance and similarity measures? This review attempts to answer the previous question through evaluating the performance (measured by accuracy, precision and recall) of the KNN using a large number of distance measures, tested on a number of real world datasets, with and without adding different levels of noise. The experimental results show that the performance of KNN classifier depends significantly on the distance used, the results showed large gaps between the performances of different distances. We found that a recently proposed non-convex distance performed the best when applied on most datasets comparing to the other tested distances. In addition, the performance of the KNN degraded only about $20\%$ while the noise level reaches $90\%$, this is true for all the distances used. This means that the KNN classifier using any of the top $10$ distances tolerate noise to a certain degree. Moreover, the results show that some distances are less affected by the added noise comparing to other distances.

artificial intelligence, dataset, machine learning, (16 more...)

arXiv.org Artificial Intelligence

1708.04321

Country:

Oceania > Australia > Australian Capital Territory > Canberra (0.04)
North America > United States > Ohio > Hamilton County > Cincinnati (0.04)
South America > Brazil > Ceará > Fortaleza (0.04)
(11 more...)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

Dependency Learning for QBF

Peitl, Tomáš, Slivovsky, Friedrich, Szeider, Stefan

Journal of Artificial Intelligence ResearchJun-18-2019

Quantified Boolean Formulas (QBFs) can be used to succinctly encode problems from domains such as formal verification, planning, and synthesis. One of the main approaches to QBF solving is Quantified Conflict Driven Clause Learning (QCDCL). By default, QCDCL assigns variables in the order of their appearance in the quantifier prefix so as to account for dependencies among variables. Dependency schemes can be used to relax this restriction and exploit independence among variables in certain cases, but only at the cost of nontrivial interferences with the proof system underlying QCDCL. We introduce dependency learning, a new technique for exploiting variable independence within QCDCL that allows solvers to learn variable dependencies on the fly. The resulting version of QCDCL enjoys improved propagation and increased flexibility in choosing variables for branching while retaining ordinary (long-distance) Q-resolution as its underlying proof system. We show that dependency learning can achieve exponential speedups over ordinary QCDCL. Experiments on standard benchmark sets demonstrate the effectiveness of this technique.

constraint, dependency, qcdcl, (13 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.11529

AI Access Foundation

11529

Journal of Artificial Intelligence Research

Country:

North America > United States > Texas > Travis County > Austin (0.14)
Europe > Austria > Vienna (0.14)
Europe > France > Nouvelle-Aquitaine > Gironde > Bordeaux (0.04)
(12 more...)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.48)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.46)

Add feedback