Energy
Anomaly Detection via Graphical Lasso
Liu, Haitao, Paffenroth, Randy C., Zou, Jian, Zhou, Chong
Anomalies and outliers are common in real-world data, and they can arise from many sources, such as sensor faults. Accordingly, anomaly detection is important both for analyzing the anomalies themselves and for cleaning the data for further analysis of its ambient structure. Nonetheless, a precise definition of anomalies is important for automated detection and herein we approach such problems from the perspective of detecting sparse latent effects embedded in large collections of noisy data. Standard Graphical Lasso-based techniques can identify the conditional dependency structure of a collection of random variables based on their sample covariance matrix. However, classic Graphical Lasso is sensitive to outliers in the sample covariance matrix. In particular, several outliers in a sample covariance matrix can destroy the sparsity of its inverse. Accordingly, we propose a novel optimization problem that is similar in spirit to Robust Principal Component Analysis (RPCA) and splits the sample covariance matrix $M$ into two parts, $M=F+S$, where $F$ is the cleaned sample covariance whose inverse is sparse and computable by Graphical Lasso, and $S$ contains the outliers in $M$. We accomplish this decomposition by adding an additional $ \ell_1$ penalty to classic Graphical Lasso, and name it "Robust Graphical Lasso (Rglasso)". Moreover, we propose an Alternating Direction Method of Multipliers (ADMM) solution to the optimization problem which scales to large numbers of unknowns. We evaluate our algorithm on both real and synthetic datasets, obtaining interpretable results and outperforming the standard robust Minimum Covariance Determinant (MCD) method and Robust Principal Component Analysis (RPCA) regarding both accuracy and speed.
Sample-Efficient Policy Learning based on Completely Behavior Cloning
Zou, Qiming, Wang, Ling, Lu, Ke, Li, Yu
Sample-E fficient Policy Learning based on Completely Behavior Cloning Qiming Zou a,, Ling Wang a,, Ke Lu b,, Y u Li b, a Department of Computer Science and T echnology, Harbin Institute of T echnology, China b Department of Management Science and Engineering, Anhui University of T echnology, ChinaAbstract Direct policy search is one of the most important algorithm of reinforcement learning. However, learning from scratch needs a large amount of experience data and can be easily prone to poor local optima. In addition to that, a partially trained policy tends to perform dangerous action to agent and environment. In order to overcome these challenges, this paper proposed a policy initialization algorithm called Policy Learning based on Completely Behavior Cloning (PLCBC). PLCBC first transforms the Model Predictive Control (MPC) controller into a piecewise a ffine (PW A) function using multi-parametric programming, and uses a neural network to express this function. By this way, PLCBC can completely clone the MPC controller without any performance loss, and is totally training-free. The experiments show that this initialization strategy can help agent learn at the high reward state region, and converge faster and better. Keywords: Deep Reinforcement Learning, Model Predictive Control, Sample E fficiency 1. Introduction Deep reinforcement learning is becoming increasingly popular for tackling challenging sequential decision making problems, and has been shown to be successful in solving a range of di fficult problems, such as games [1, 2], robotic control [3] and locomotion [4, 5]. One particular appealing prospect is to use deep neural network parametrization to minimize the burden for manual policy engineering [6].
Adversarial Uncertainty Quantification in Physics-Informed Neural Networks
We present a deep learning framework for quantifying and propagating uncertainty in systems governed by non-linear differential equations using physics-informed neural networks. Specifically, we employ latent variable models to construct probabilistic representations for the system states, and put forth an adversarial inference procedure for training them on data, while constraining their predictions to satisfy given physical laws expressed by partial differential equations. Such physics-informed constraints provide a regularization mechanism for effectively training deep generative models as surrogates of physical systems in which the cost of data acquisition is high, and training data-sets are typically small. This provides a flexible framework for characterizing uncertainty in the outputs of physical systems due to randomness in their inputs or noise in their observations that entirely bypasses the need for repeatedly sampling expensive experiments or numerical simulators. We demonstrate the effectiveness of our approach through a series of examples involving uncertainty propagation in non-linear conservation laws, and the discovery of constitutive laws for flow through porous media directly from noisy data.
Block Belief Propagation for Parameter Learning in Markov Random Fields
Lu, You, Liu, Zhiyuan, Huang, Bert
Traditional learning methods for training Markov random fields require doing inference over all variables to compute the likelihood gradient. The iteration complexity for those methods therefore scales with the size of the graphical models. In this paper, we propose \emph{block belief propagation learning} (BBPL), which uses block-coordinate updates of approximate marginals to compute approximate gradients, removing the need to compute inference on the entire graphical model. Thus, the iteration complexity of BBPL does not scale with the size of the graphs. We prove that the method converges to the same solution as that obtained by using full inference per iteration, despite these approximations, and we empirically demonstrate its scalability improvements over standard training methods.
Biologically-plausible learning algorithms can scale to large datasets
Xiao, Will, Chen, Honglin, Liao, Qianli, Poggio, Tomaso
The backpropagation (BP) algorithm is often thought to be biologically implausible in the brain. One of the main reasons is that BP requires symmetric weight matrices in the feedforward and feedback pathways. To address this "weight transport problem" (Grossberg, 1987), two more biologically plausible algorithms, proposed by Liao et al. (2016) and Lillicrap et al. (2016), relax BP's weight symmetry requirements and demonstrate comparable learning capabilities to that of BP on small datasets. However, a recent study by Bartunov et al. (2018) evaluate variants of target-propagation (TP) and feedback alignment (FA) on MINIST, CIFAR, and ImageNet datasets, and find that although many of the proposed algorithms perform well on MNIST and CIFAR, they perform significantly worse than BP on ImageNet. Here, we additionally evaluate the sign-symmetry algorithm (Liao et al., 2016), which differs from both BP and FA in that the feedback and feedforward weights share signs but not magnitudes. We examine the performance of sign-symmetry and feedback alignment on ImageNet and MS COCO datasets using different network architectures (ResNet-18 and AlexNet for ImageNet, RetinaNet for MS COCO). Surprisingly, networks trained with sign-symmetry can attain classification performance approaching that of BP-trained networks. These results complement the study by Bartunov et al. (2018), and establish a new benchmark for future biologically plausible learning algorithms on more difficult datasets and more complex architectures. This material is based upon work supported by the Center for Brains, Minds and Machines (CBMM), funded by NSF STC award CCF-1231216.
Manifold Learning of Four-dimensional Scanning Transmission Electron Microscopy
Li, Xin, Dyck, Ondrej E., Oxley, Mark P., Lupini, Andrew R., McInnes, Leland, Healy, John, Jesse, Stephen, Kalinin, Sergei V.
Four-dimensional scanning transmission electron microscopy (4D-STEM) of local atomic diffraction patterns is emerging as a powerful technique for probing intricate details of atomic structure and atomic electric fields. However, efficient processing and interpretation of large volumes of data remain challenging, especially for two-dimensional or light materials because the diffraction signal recorded on the pixelated arrays is weak. Here we employ data-driven manifold leaning approaches for straightforward visualization and exploration analysis of the 4D-STEM datasets, distilling real-space neighboring effects on atomically resolved deflection patterns from single-layer graphene, with single dopant atoms, as recorded on a pixelated detector. These extracted patterns relate to both individual atom sites and sublattice structures, effectively discriminating single dopant anomalies via multi-mode views. We believe manifold learning analysis will accelerate physics discoveries coupled between data-rich imaging mechanisms and materials such as ferroelectric, topological spin and van der Waals heterostructures.
Unveiling Swarm Intelligence with Network Science$-$the Metaphor Explained
Oliveira, Marcos, Pinheiro, Diego, Macedo, Mariana, Bastos-Filho, Carmelo, Menezes, Ronaldo
Self-organization is a natural phenomenon that emerges in systems with a large number of interacting components. Self-organized systems show robustness, scalability, and flexibility, which are essential properties when handling real-world problems. Swarm intelligence seeks to design nature-inspired algorithms with a high degree of self-organization. Yet, we do not know why swarm-based algorithms work well and neither we can compare the different approaches in the literature. The lack of a common framework capable of characterizing these several swarm-based algorithms, transcending their particularities, has led to a stream of publications inspired by different aspects of nature without much regard as to whether they are similar to already existing approaches. We address this gap by introducing a network-based framework$-$the interaction network$-$to examine computational swarm-based systems via the optics of social dynamics. We discuss the social dimension of several swarm classes and provide a case study of the Particle Swarm Optimization. The interaction network enables a better understanding of the plethora of approaches currently available by looking at them from a general perspective focusing on the structure of the social interactions.
New Quantum Machine Learning Algorithm Could Crush Big Data - CTOvision.com
Strategically deployed across the grid, Phasor Measurement Units (PMUs) are sensor devices that measure currents and voltages at a particular time. PMUs generate huge amounts of data that help utility companies monitor the power grid. The storage of these large datasets is already a challenge. Accessing and analyzing these datasets is also an issue that is only getting more complicated over time. In the U.S, the network of PMUs collects a cumulative total of 3 million gigabytes of data every two seconds.
Solar Enablement Initiative in Australia: Report on Efficiently Identifying Critical Cases for Evaluating the Voltage Impact of Large PV Investment
Shafiei, Mehdi, Liu, Aaron, Ledwich, Gerard, Walker, Geoffery, Morosini, Gian-Marco, Terry, Jack
The increasing quantity of PV generation connected to distribution networks is creating challenges in maintaining and controlling voltages in those distribution networks. Determining the maximum hosting capacity for new PV installations based on the historical data is an essential task for distribution networks. Analyzing all historical data in large distribution networks is impractical. Therefore, this paper focuses on how to time efficiently identify the critical cases for evaluating the voltage impacts of the new large PV applications in medium voltage (MV) distribution networks. A systematic approach is proposed to cluster medium voltage nodes based on electrical adjacency and time blocks. MV nodes are clustered along with the voltage magnitudes and time blocks. Critical cases of each cluster can be used for further power flow study. This method is scalable and can time efficiently identify cases for evaluating PV investment on medium voltage networks.
Robust identification of thermal models for in-production High-Performance-Computing clusters with machine learning-based data selection
Pittino, Federico, Diversi, Roberto, Benini, Luca, Bartolini, Andrea
Power and thermal management are critical components of High-Performance-Computing (HPC) systems, due to their high power density and large total power consumption. The assessment of thermal dissipation by means of compact models directly from the thermal response of the final device enables more robust and precise thermal control strategies as well as automated diagnosis. However, when dealing with large scale systems "in production", the accuracy of learned thermal models depends on the dynamics of the power excitation, which depends also on the executed workload, and measurement nonidealities, such as quantization. In this paper we show that, using an advanced system identification algorithm, we are able to generate very accurate thermal models (average error lower than our sensors quantization step of 1{\deg}C) for a large scale HPC system on real workloads for very long time periods. However, we also show that: 1) not all real workloads allow for the identification of a good model; 2) starting from the theory of system identification it is very difficult to evaluate if a trace of data leads to a good estimated model. We then propose and validate a set of techniques based on machine learning and deep learning algorithms for the choice of data traces to be used for model identification. We also show that deep learning techniques are absolutely necessary to correctly choose such traces up to 96% of the times.