Oceania
Manifold Optimisation Assisted Gaussian Variational Approximation
Zhou, Bingxin, Gao, Junbin, Tran, Minh-Ngoc, Gerlach, Richard
Variational approximation methods are a way to approximate the posterior in Bayesian inference especially when the dataset has a large volume or high dimension. Factor covariance structure was introduced in previous work with three restrictions to handle the problem of computational infeasibility in Gaussian approximation. However, the three strong constraints on the covariance matrix could possibly break down during the process of the structure optimization, and the identification issue could still possibly exist within the final approximation. In this paper, we consider two types of manifold parameterization, Stiefel manifold and Grassmann manifold, to address the problems. Moreover, the Riemannian stochastic gradient descent method is applied to solve the resulting optimization problem while maintaining the orthogonal factors. Results from two experiments demonstrate that our model fixes the potential issue of the previous method with comparable accuracy and competitive converge speed even in high-dimensional problems.
Deducing Kurdyka-{\L}ojasiewicz exponent via inf-projection
Yu, Peiran, Li, Guoyin, Pong, Ting Kei
Kurdyka-{\L}ojasiewicz (KL) exponent plays an important role in estimating the convergence rate of many contemporary first-order methods. In particular, a KL exponent of $\frac12$ is related to local linear convergence. Nevertheless, KL exponent is in general extremely hard to estimate. In this paper, we show under mild assumptions that KL exponent is preserved via inf-projection. Inf-projection is a fundamental operation that is ubiquitous when reformulating optimization problems via the lift-and-project approach. By studying its operation on KL exponent, we show that the KL exponent is $\frac12$ for several important convex optimization models, including some semidefinite-programming-representable functions and functions that involve $C^2$-cone reducible structures, under conditions such as strict complementarity. Our results are applicable to concrete optimization models such as group fused Lasso and overlapping group Lasso. In addition, for nonconvex models, we show that the KL exponent of many difference-of-convex functions can be derived from that of their natural majorant functions, and the KL exponent of the Bregman envelope of a function is the same as that of the function itself. Finally, we estimate the KL exponent of the sum of the least squares function and the indicator function of the set of matrices of rank at most $k$.
Hierarchical Critics Assignment for Multi-agent Reinforcement Learning
In this paper, we investigate the use of global information to speed up the learning process and increase the cumulative rewards of multi-agent reinforcement learning (MARL) tasks. Within the actor-critic MARL, we introduce multiple cooperative critics from two levels of the hierarchy and propose a hierarchical critic-based MARL algorithm. In our approach, the agent is allowed to receive information from local and global critics in a competition task. The agent not only receives low-level details but also considers coordination from high levels to obtain global information for increasing operational performance. Here, we define multiple cooperative critics in a top-down hierarchy, called the Hierarchical Critic Assignment (HCA) framework. Our experiment, a two-player tennis competition task performed in the Unity environment, tested the HCA multi-agent framework based on the Asynchronous Advantage Actor-Critic (A3C) with Proximal Policy Optimization (PPO) algorithm. The results showed that the HCA framework outperforms the non-hierarchical critic baseline method on MARL tasks.
Automatic Bayesian Density Analysis
Vergari, Antonio, Molina, Alejandro, Peharz, Robert, Ghahramani, Zoubin, Kersting, Kristian, Valera, Isabel
Making sense of a dataset in an automatic and unsupervised fashion is a challenging problem in statistics and AI. Classical approaches for {exploratory data analysis} are usually not flexible enough to deal with the uncertainty inherent to real-world data: they are often restricted to fixed latent interaction models and homogeneous likelihoods; they are sensitive to missing, corrupt and anomalous data; moreover, their expressiveness generally comes at the price of intractable inference. As a result, supervision from statisticians is usually needed to find the right model for the data. However, since domain experts are not necessarily also experts in statistics, we propose Automatic Bayesian Density Analysis (ABDA) to make exploratory data analysis accessible at large. Specifically, ABDA allows for automatic and efficient missing value estimation, statistical data type and likelihood discovery, anomaly detection and dependency structure mining, on top of providing accurate density estimation. Extensive empirical evidence shows that ABDA is a suitable tool for automatic exploratory analysis of mixed continuous and discrete tabular data.
Artificial Intelligence Study of Human Genome Finds Unknown Human Ancestor
Can the minds of machines teach us something new about what it means to be human? When it comes to the intricate story of our species' complex origins and evolution, it appears that they can. A recent study used machine learning technology to analyze eight leading models of human origins and evolution, and the program identified evidence in the human genome of a "ghost population" of human ancestors. The analysis suggests that a previously unknown and long-extinct group of hominins interbred with Homo sapiens in Asia and Oceania somewhere along the long, winding road of human evolutionary history, leaving behind only fragmented traces in modern human DNA. The study, published in Nature Communications, is one of the first examples of how machine learning can help reveal clues to our own origins.
Machine learning heats up the contest for human talent
"Graduates at IBM are coming into roles where you provide services to a range of different industries as opposed to just working in one," she says. "From a career perspective, they can move across different industries but they also move across different functions. You retain your core expertise but you also get to do different jobs because of the diversity of our clients." IBM works with dozens of Australia's biggest companies using Watson to drive machine learning and data analysis within business, and has the advantage of being able to supply whole teams of experts as challenges arise. In contrast, even big industry employers often have only a few specialists in key areas.
Yes, we GAN: Applying Adversarial Techniques for Autonomous Driving
Uricar, Michal, Krizek, Pavel, Hurych, David, Sobh, Ibrahim, Yogamani, Senthil, Denny, Patrick
Generative Adversarial Networks (GAN) have gained a lot of popularity from their introduction in 2014 till present. Research on GAN is rapidly growing and there are many variants of the original GAN focusing on various aspects of deep learning. GAN are perceived as the most impactful direction of machine learning in the last decade. This paper focuses on the application of GAN in autonomous driving including topics such as advanced data augmentation, loss function learning, semi-supervised learning, etc. We formalize and review key applications of adversarial techniques and discuss challenges and open problems to be addressed.
Pure Exploration with Multiple Correct Answers
Degenne, Rémy, Koolen, Wouter M.
We determine the sample complexity of pure exploration bandit problems with multiple good answers. We derive a lower bound using a new game equilibrium argument. We show how continuity and convexity properties of single-answer problems ensures that the Track-and-Stop algorithm has asymptotically optimal sample complexity. However, that convexity is lost when going to the multiple-answer setting. We present a new algorithm which extends Track-and-Stop to the multiple-answer case and has asymptotic sample complexity matching the lower bound.
Passing Tests without Memorizing: Two Models for Fooling Discriminators
Bousquet, Olivier, Livni, Roi, Moran, Shay
We introduce two mathematical frameworks for foolability in the context of generative distribution learning. In a nuthsell, fooling is an algorithmic task in which the input sample is drawn from some target distribution and the goal is to output a synthetic distribution that is indistinguishable from the target w.r.t to some fixed class of tests. This framework received considerable attention in the context of Generative Adversarial Networks (GANs), a recently proposed approach which achieves impressive empirical results. From a theoretical viewpoint this problem seems difficult to model. This is due to the fact that in its basic form, the notion of foolability is susceptible to a type of overfitting called memorizing. This raises a challenge of devising notions and definitions that separate between fooling algorithms that generate new synthetic data vs. algorithms that merely memorize or copy the training set. The first model we consider is called GAM--Foolability and is inspired by GANs. Here the learner has only an indirect access to the target distribution via a discriminator. The second model, called DP--Foolability, exploits the notion of differential privacy as a candidate criterion for non-memorization. We proceed to characterize foolability within these two models and study their interrelations. We show that DP--Foolability implies GAM--Foolability and prove partial results with respect to the converse. It remains, though, an open question whether GAM--Foolability implies DP--Foolability. We also present an application in the context of differentially private PAC learning. We show that from a statistical perspective, for any class H, learnability by a private proper learner is equivalent to the existence of a private sanitizer for H. This can be seen as an analogue of the equivalence between uniform convergence and learnability in classical PAC learning.
Sydney Machine Learning (Sydney, Australia)
PLEASE NOTE: that RSVPing to this page DOES NOT GRANT YOU ACCESS to this meetup, Spaces are limited! DESCRIPTION How do we design Ai systems that we trust? Algorithmic Bias, Algorithmic Transparency, Technological Unemployment, Data Privacy & Algorithmic Misinformation (fake news) are just some of the issues facing the fair and ethical use of Machine Learning. In collaboration with Microsoft for this DSAi special edition Ethics & Interpretability event - come along to learn from industry leaders how issues such as Algorithmic Bias might affect you & what is being done to address the ethical use of Machine Learning in 2019. 'Ethics for Artificial Intelligence' In this 20 minute presentation, Aurelie will provide a formal introduction as to what ethical and responsible AI is.