Oceania
Explainable Reinforcement Learning Through a Causal Lens
Madumal, Prashan, Miller, Tim, Sonenberg, Liz, Vetere, Frank
Prevalent theories in cognitive science propose that humans understand and represent the knowledge of the world through causal relationships. In making sense of the world, we build causal models in our mind to encode cause-effect relations of events and use these to explain why new events happen. In this paper, we use causal models to derive causal explanations of behaviour of reinforcement learning agents. We present an approach that learns a structural causal model during reinforcement learning and encodes causal relationships between variables of interest. This model is then used to generate explanations of behaviour based on counterfactual analysis of the causal model. We report on a study with 120 participants who observe agents playing a real-time strategy game (Starcraft II) and then receive explanations of the agents' behaviour. We investigated: 1) participants' understanding gained by explanations through task prediction; 2) explanation satisfaction and 3) trust. Our results show that causal model explanations perform better on these measures compared to two other baseline explanation models.
FOBE and HOBE: First- and High-Order Bipartite Embeddings
Typical graph embeddings may not capture type-specific bipartite graph features that arise in such areas as recommender systems, data visualization, and drug discovery. Machine learning methods utilized in these applications would be better served with specialized embedding techniques. We propose two embeddings for bipartite graphs that decompose edges into sets of indirect relationships between node neighborhoods. When sampling higher-order relationships, we reinforce similarities through algebraic distance on graphs. We also introduce ensemble embeddings to combine both into a "best of both worlds" embedding. The proposed methods are evaluated on link prediction and recommendation tasks and compared with other state-of-the-art embeddings. Our embeddings are found to perform better on recommendation tasks and equally competitive in link prediction. While being all highly beneficial in applications, we demonstrate that none of the existing state-of-the-art or our embeddings is clearly superior (in contrast to what is claimed in many papers), and discuss the trade offs present among them.
What engineers can do to help ensure artificial intelligence is ethical - Create
Professor Toby Walsh is a top artificial intelligence (AI) researcher and a vocal advocate for a national and international ban on autonomous weapons, or killer robots. As the technology becomes a more important part of our lives, engineers also need to discuss the issues and impacts introduced by "more mundane" applications such as facial recognition and news algorithms, Walsh told create. "There are some important ethical dimensions to the way [the technology] is being used and how it will impact our lives," he said. To this end, CSIRO is seeking input on a discussion paper titled Artificial Intelligence: Australia's Ethics Framework. The paper, which Walsh helped review, identifies ways to "achieve the best possible results from AI, while keeping the well-being of Australians as the top priority".
SemEval-2019 Task 8: Fact Checking in Community Question Answering Forums
Mihaylova, Tsvetomila, Karadjov, Georgi, Atanasova, Pepa, Baly, Ramy, Mohtarami, Mitra, Nakov, Preslav
We present SemEval-2019 Task 8 on Fact Checking in Community Question Answering Forums, which features two subtasks. Subtask A is about deciding whether a question asks for factual information vs. an opinion/advice vs. just socializing. Subtask B asks to predict whether an answer to a factual question is true, false or not a proper answer. We received 17 official submissions for subtask A and 11 official submissions for Subtask B. For subtask A, all systems improved over the majority class baseline. For Subtask B, all systems were below a majority class baseline, but several systems were very close to it. The leaderboard and the data from the competition can be found at http://competitions.codalab.org/competitions/20022
Multi-view Information-theoretic Co-clustering for Co-occurrence Data
Xu, Peng, Deng, Zhaohong, Choi, Kup-Sze, Cao, Longbing, Wang, Shitong
Multi-view clustering has received much attention recently. Most of the existing multi-view clustering methods only focus on one-sided clustering. As the co-occurring data elements involve the counts of sample-feature co-occurrences, it is more efficient to conduct two-sided clustering along the samples and features simultaneously. To take advantage of two-sided clustering for the co-occurrences in the scene of multi-view clustering, a two-sided multi-view clustering method is proposed, i.e., multi-view information-theoretic co-clustering (MV-ITCC). The proposed method realizes two-sided clustering for co-occurring multi-view data under the formulation of information theory. More specifically, it exploits the agreement and disagreement among views by sharing a common clustering results along the sample dimension and keeping the clustering results of each view specific along the feature dimension. In addition, the mechanism of maximum entropy is also adopted to control the importance of different views, which can give a right balance in leveraging the agreement and disagreement. Extensive experiments are conducted on text and image multi-view datasets. The results clearly demonstrate the superiority of the proposed method.
Lifelong Neural Predictive Coding: Sparsity Yields Less Forgetting when Learning Cumulatively
Ororbia, Alexander, Mali, Ankur, Kifer, Daniel, Giles, C. Lee
In lifelong learning systems, especially those based on artificial neural networks, one of the biggest obstacles is the severe inability to retain old knowledge as new information is encountered. This phenomenon is known as catastrophic forgetting. In this paper, we present a new connectionist model, the Sequential Neural Coding Network, and its learning procedure, grounded in the neurocognitive theory of predictive coding. The architecture experiences significantly less forgetting as compared to standard neural models and outperforms a variety of previously proposed remedies and methods when trained across multiple task datasets in a stream-like fashion. The promising performance demonstrated in our experiments offers motivation that directly incorporating mechanisms prominent in real neuronal systems, such as competition, sparse activation patterns, and iterative input processing, can create viable pathways for tackling the challenge of lifelong machine learning.
AI: Smart Clothes as instructors - Innovation Origins
Until a few years ago, clothing served only to protect people and at the same time still had fashionable aspects. But meanwhile, our second skin can do more and more. The measurement of body data such as pulse value or calorie consumption by means of integrated sensors is almost an old hat. Now, however, the clothing will also take on teaching functions through artificial intelligence: On the one hand as a trainer for humans, on the other hand as a programmer for robots. The latest development comes from Turing Sense.
Learning Surrogate Losses
Grabocka, Josif, Scholz, Randolf, Schmidt-Thieme, Lars
The minimization of loss functions is the heart and soul of Machine Learning. In this paper, we propose an off-the-shelf optimization approach that can minimize virtually any non-differentiable and non-decomposable loss function (e.g. Miss-classification Rate, AUC, F1, Jaccard Index, Mathew Correlation Coefficient, etc.) seamlessly. Our strategy learns smooth relaxation versions of the true losses by approximating them through a surrogate neural network. The proposed loss networks are set-wise models which are invariant to the order of mini-batch instances. Ultimately, the surrogate losses are learned jointly with the prediction model via bilevel optimization. Empirical results on multiple datasets with diverse real-life loss functions compared with state-of-the-art baselines demonstrate the efficiency of learning surrogate losses.
Continual Reinforcement Learning in 3D Non-stationary Environments
Lomonaco, Vincenzo, Desai, Karan, Culurciello, Eugenio, Maltoni, Davide
High-dimensional always-changing environments constitute a hard challenge for current reinforcement learning techniques. Artificial agents, nowadays, are often trained off-line in very static and controlled conditions in simulation such that training observations can be thought as sampled i.i.d. from the entire observations space. However, in real world settings, the environment is often non-stationary and subject to unpredictable, frequent changes. In this paper we propose and openly release CRLMaze, a new benchmark for learning continually through reinforcement in a complex 3D non-stationary task based on ViZDoom and subject to several environmental changes. Then, we introduce an end-to-end model-free continual reinforcement learning strategy showing competitive results with respect to four different baselines and not requiring any access to additional supervised signals, previously encountered environmental conditions or observations.
Sparse Gaussian Process Modulated Hawkes Process
Zhang, Rui, Walder, Christian, Rizoiu, Marian-Andrei
The Hawkes process has been widely applied to modeling self-exciting events, including neuron spikes, earthquakes and tweets. To avoid designing parametric kernel functions and to be able to quantify the prediction confidence, non-parametric Bayesian Hawkes processes have been proposed. However the inference of such models suffers from unscalability or slow convergence. In this paper, we first propose a new non-parametric Bayesian Hawkes process whose triggering kernel is modeled as a squared sparse Gaussian process. Second, we present the variational inference scheme for the model optimization, which has the advantage of linear time complexity by leveraging the stationarity of the triggering kernel. Third, we contribute a tighter lower bound than the evidence lower bound of the marginal likelihood for the model selection. Finally, we exploit synthetic data and large-scale social media data to validate the efficiency of our method and the practical utility of our approximate marginal likelihood. We show that our approach outperforms state-of-the-art non-parametric Bayesian and non-Bayesian methods.