Bayesian Learning
Fast Distribution Grid Line Outage Identification with $\mu$PMU
Liao, Yizheng, Weng, Yang, Tan, Chin-Woo, Rajagopal, Ram
The growing integration of distributed energy resources (DERs) in urban distribution grids raises various reliability issues due to DER's uncertain and complex behaviors. With a large-scale DER penetration, traditional outage detection methods, which rely on customers making phone calls and smart meters' "last gasp" signals, will have limited performance, because the renewable generators can supply powers after line outages and many urban grids are mesh so line outages do not affect power supply. To address these drawbacks, we propose a data-driven outage monitoring approach based on the stochastic time series analysis from micro phasor measurement unit ($\mu$PMU). Specifically, we prove via power flow analysis that the dependency of time-series voltage measurements exhibits significant statistical changes after line outages. This makes the theory on optimal change-point detection suitable to identify line outages via $\mu$PMUs with fast and accurate sampling. However, existing change point detection methods require post-outage voltage distribution unknown in distribution systems. Therefore, we design a maximum likelihood-based method to directly learn the distribution parameters from $\mu$PMU data. We prove that the estimated parameters-based detection still achieves the optimal performance, making it extremely useful for distribution grid outage identifications. Simulation results show highly accurate outage identification in eight distribution grids with 14 configurations with and without DERs using $\mu$PMU data.
The Deep Weight Prior
Atanov, Andrei, Ashukha, Arsenii, Struminsky, Kirill, Vetrov, Dmitry, Welling, Max
Bayesian inference is known to provide a general framework for incorporating prior knowledge or specific properties into machine learning models via carefully choosing a prior distribution. In this work, we propose a new type of prior distributions for convolutional neural networks, deep weight prior, that in contrast to previously published techniques, favors empirically estimated structure of convolutional filters e.g., spatial correlations of weights. We define deep weight prior as an implicit distribution and propose a method for variational inference with such type of implicit priors. In experiments, we show that deep weight priors can improve the performance of Bayesian neural networks on several problems when training data is limited. Also, we found that initialization of weights of conventional networks with samples from deep weight prior leads to faster training.
Arm Leads Project to Develop an Armpit-Sniffing Plastic AI Chip
Body odor is a stubborn problem. Sensors and the computing attached to them struggle to perceive armpit odors in the way humans do, because B.O. is really a complex mix of dozens of gaseous chemicals. The UK's PlasticArmPit project is designing the first machine learningโenabled flexible plastic sensor chip. Its target audience: those who think they might stink. The prototype chip will be manufactured and tested in 2019.
Machine Learning Tutorial for Beginners - Learn Machine Learning - DataFlair
In this machine learning tutorial, we are going to discuss the detailed what is Machine Learning and the difference between data mining and machine learning. Moreover, we will discuss different types of Machine Learning and different approaches to Machine Learning. Machine Learning is a science to make the machine capable of taking the decision itself. These systems also have the ability to learn from past experience or analyze historical data. It provides results according to its experience. So, let's start the Machine Learning Tutorial.
Bayesian Reinforcement Learning in Factored POMDPs
Katt, Sammie, Oliehoek, Frans, Amato, Christopher
Bayesian approaches provide a principled solution to the exploration-exploitation trade-off in Reinforcement Learning. Typical approaches, however, either assume a fully observable environment or scale poorly. This work introduces the Factored Bayes-Adaptive POMDP model, a framework that is able to exploit the underlying structure while learning the dynamics in partially observable systems. We also present a belief tracking method to approximate the joint posterior over state and model variables, and an adaptation of the Monte-Carlo Tree Search solution method, which together are capable of solving the underlying problem near-optimally. Our method is able to learn efficiently given a known factorization or also learn the factorization and the model parameters at the same time. We demonstrate that this approach is able to outperform current methods and tackle problems that were previously infeasible.
Bayesian State Estimation for Unobservable Distribution Systems via Deep Learning
Mestav, Kursat Rasim, Luengo-Rozas, Jaime, Tong, Lang
Abstract--The problem of state estimation for unobservable distribution systems is considered. A Bayesian approach is proposed that implements Bayesian inference with a deep neural network to achieve the minimum mean squared error estimation of network states for real-time applications. The proposed technique consists of distribution learning for stochastic power injection, a Monte Carlo technique for the training of a deep neural network for state estimation, and a Bayesian bad data detection and cleansing algorithm. Structural characteristics of the deep neural networks are investigated. Simulations illustrate the accuracy of Bayesian state estimation for unobservable systems and demonstrate the benefit of employing a deep neural network. Numerical results show the robustness of Bayesian state estimation against modeling and estimation errors of power injection distributions and the presence of bad data. Comparing with pseudo-measurement techniques, direct Bayesian state estimation with deep neural networks outperforms existing benchmarks. We consider the problem of state estimation for distribution systems that have limited measurements. This problem is motivated by the need of coping with the rising presence of distributed energy resources (DER) in distribution systems.
Comparison of Feature Extraction Methods and Predictors for Income Inference
Fixman, Martin, Minnoni, Martin, Sarraute, Carlos
Patterns of mobile phone communications, coupled with the information of the social network graph and financial behavior, allow us to make inferences of users' socio-economic attributes such as their income level. We present here several methods to extract features from mobile phone usage (calls and messages), and compare different combinations of supervised machine learning techniques and sets of features used as input for the inference of users' income. Our experimental results show that the Bayesian method based on the communication graph outperforms standard machine learning algorithms using node-based features.
Benchmarking datasets for Anomaly-based Network Intrusion Detection: KDD CUP 99 alternatives
Divekar, Abhishek, Parekh, Meet, Savla, Vaibhav, Mishra, Rudra, Shirole, Mahesh
Abstract--Machine Learning has been steadily gaining traction for its use in Anomaly-based Network Intrusion Detection Systems (A-NIDS). Research into this domain is frequently performed using the KDD CUP 99 dataset as a benchmark. Several studies question its usability while constructing a contemporary NIDS, due to the skewed response distribution, nonstationarity, and failure to incorporate modern attacks. In this paper, we compare the performance for KDD-99 alternatives when trained using classification models commonly found in literature: Neural Network, Support Vector Machine, Decision Tree, Random Forest, Naive Bayes and K-Means. Applying the SMOTE oversampling technique and random undersampling, we create a balanced version of NSL-KDD and prove that skewed target classes in KDD-99 and NSL-KDD hamper the efficacy of classifiers on minority classes (U2R and R2L), leading to possible security risks. We explore UNSW-NB15, a modern substitute to KDD-99 with greater uniformity of pattern distribution. We benchmark this dataset before and after SMOTE oversampling to observe the effect on minority performance. Our results indicate that classifiers trained on UNSW-NB15 match or better the Weighted F1-Score of those trained on NSL-KDD and KDD-99 in the binary case, thus advocating UNSW-NB15 as a modern substitute to these datasets. Network security is an ever-evolving discipline where new types of attacks manifest and must be mitigated on a daily basis.
Learning Temporal Point Processes via Reinforcement Learning
Li, Shuang, Xiao, Shuai, Zhu, Shixiang, Du, Nan, Xie, Yao, Song, Le
Social goods, such as healthcare, smart city, and information networks, often produce ordered event data in continuous time. The generative processes of these event data can be very complex, requiring flexible models to capture their dynamics. Temporal point processes offer an elegant framework for modeling event data without discretizing the time. However, the existing maximum-likelihood-estimation (MLE) learning paradigm requires hand-crafting the intensity function beforehand and cannot directly monitor the goodness-of-fit of the estimated model in the process of training. To alleviate the risk of model-misspecification in MLE, we propose to generate samples from the generative model and monitor the quality of the samples in the process of training until the samples and the real data are indistinguishable. We take inspiration from reinforcement learning (RL) and treat the generation of each event as the action taken by a stochastic policy. We parameterize the policy as a flexible recurrent neural network and gradually improve the policy to mimic the observed event distribution. Since the reward function is unknown in this setting, we uncover an analytic and nonparametric form of the reward function using an inverse reinforcement learning formulation. This new RL framework allows us to derive an efficient policy gradient algorithm for learning flexible point process models, and we show that it performs well in both synthetic and real data.
A Bayesian Perspective of Statistical Machine Learning for Big Data
Sambasivan, Rajiv, Das, Sourish, Sahu, Sujit K
Statistical Machine Learning (SML) refers to a body of algorithms and methods by which computers are allowed to discover important features of input data sets which are often very large in size. The very task of feature discovery from data is essentially the meaning of the keyword `learning' in SML. Theoretical justifications for the effectiveness of the SML algorithms are underpinned by sound principles from different disciplines, such as Computer Science and Statistics. The theoretical underpinnings particularly justified by statistical inference methods are together termed as statistical learning theory. This paper provides a review of SML from a Bayesian decision theoretic point of view -- where we argue that many SML techniques are closely connected to making inference by using the so called Bayesian paradigm. We discuss many important SML techniques such as supervised and unsupervised learning, deep learning, online learning and Gaussian processes especially in the context of very large data sets where these are often employed. We present a dictionary which maps the key concepts of SML from Computer Science and Statistics. We illustrate the SML techniques with three moderately large data sets where we also discuss many practical implementation issues. Thus the review is especially targeted at statisticians and computer scientists who are aspiring to understand and apply SML for moderately large to big data sets.