AITopics | Energy

Collaborating Authors

Energy

Latent Bayesian melding for integrating individual and population models

Zhong, Mingjun, Goddard, Nigel, Sutton, Charles

Neural Information Processing SystemsDec-31-2015

In many statistical problems, a more coarse-grained model may be suitable for population-level behaviour, whereas a more detailed model is appropriate for accurate modelling of individual behaviour. This raises the question of how to integrate both types of models. Methods such as posterior regularization follow the idea of generalized moment matching, in that they allow matchingexpectations between two models, but sometimes both models are most conveniently expressed as latent variable models. We propose latent Bayesian melding, which is motivated by averaging the distributions over populations statistics of both the individual-level and the population-level models under a logarithmic opinion pool framework. In a case study on electricity disaggregation, which is a type of single-channel blind source separation problem, we show that latent Bayesian melding leads to significantly more accurate predictions than an approach based solely on generalized moment matching.

artificial intelligence, bayesian, machine learning, (15 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom (0.93)

Industry: Energy (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.47)

Add feedback

Large-Scale Bayesian Multi-Label Learning via Topic-Based Label Embeddings

Rai, Piyush, Hu, Changwei, Henao, Ricardo, Carin, Lawrence

Neural Information Processing SystemsDec-31-2015

We present a scalable Bayesian multi-label learning model based on learning low-dimensional label embeddings. Our model assumes that each label vector is generated as a weighted combination of a set of topics (each topic being a distribution over labels), where the combination weights (i.e., the embeddings) for each label vector are conditioned on the observed feature vector. This construction, coupled with a Bernoulli-Poisson link function for each label of the binary label vector, leads to a model with a computational cost that scales in the number of positive labels in the label matrix. This makes the model particularly appealing for real-world multi-label learning problems where the label matrix is usually very massive but highly sparse. Using a data-augmentation strategy leads to full local conjugacy in our model, facilitating simple and very efficient Gibbs sampling, as well as an Expectation Maximization algorithm for inference. Also, predicting the label vector at test time does not require doing an inference for the label embeddings and can be done in closed form. We report results on several benchmark data sets, comparing our model with various state-of-the art methods.

artificial intelligence, bayesian inference, machine learning, (15 more...)

Neural Information Processing Systems

Genre:

Research Report (0.67)
Instructional Material > Course Syllabus & Notes (0.46)

Industry:

Energy > Power Industry (0.46)
Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.90)

Add feedback

Bayesian Optimization with Exponential Convergence

Kawaguchi, Kenji, Kaelbling, Leslie Pack, Lozano-Pérez, Tomás

Neural Information Processing SystemsDec-31-2015

This paper presents a Bayesian optimization method with exponential convergence without the need of auxiliary optimization and without the delta-cover sampling. Most Bayesian optimization methods require auxiliary optimization: an additional non-convex global optimization problem, which can be time-consuming and hard to implement in practice. Also, the existing Bayesian optimization method with exponential convergence requires access to the delta-cover sampling, which was considered to be impractical. Our approach eliminates both requirements and achieves an exponential convergence rate.

algorithm, artificial intelligence, optimization problem, (14 more...)

Neural Information Processing Systems

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)

Genre: Research Report (0.68)

Industry: Energy (0.46)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)

Add feedback

Distinguishing cause from effect using observational data: methods and benchmarks

Mooij, Joris M., Peters, Jonas, Janzing, Dominik, Zscheischler, Jakob, Schölkopf, Bernhard

arXiv.org Artificial IntelligenceDec-24-2015

The discovery of causal relationships from purely observational data is a fundamental problem in science. The most elementary form of such a causal discovery problem is to decide whether X causes Y or, alternatively, Y causes X, given joint observations of two variables X, Y. An example is to decide whether altitude causes temperature, or vice versa, given only joint measurements of both variables. Even under the simplifying assumptions of no confounding, no feedback loops, and no selection bias, such bivariate causal discovery problems are challenging. Nevertheless, several approaches for addressing those problems have been proposed in recent years. We review two families of such methods: Additive Noise Methods (ANM) and Information Geometric Causal Inference (IGCI). We present the benchmark CauseEffectPairs that consists of data for 100 different cause-effect pairs selected from 37 datasets from various domains (e.g., meteorology, biology, medicine, engineering, economy, etc.) and motivate our decisions regarding the "ground truth" causal directions of all pairs. We evaluate the performance of several bivariate causal discovery methods on these real-world benchmark data and in addition on artificially simulated data. Our empirical results on real-world data indicate that certain methods are indeed able to distinguish cause from effect using only purely observational data, although more benchmark data would be needed to obtain statistically significant conclusions. One of the best performing methods overall is the additive-noise method originally proposed by Hoyer et al. (2009), which obtains an accuracy of 63+-10 % and an AUC of 0.74+-0.05 on the real-world benchmark. As the main theoretical contribution of this work we prove the consistency of that method.

janzing, upstream oil & gas, vascular disease, (26 more...)

arXiv.org Artificial Intelligence

1412.3773

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Netherlands (0.14)
(6 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.67)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Banking & Finance (0.92)
Materials (0.92)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.67)

Add feedback

Information-Theoretic Bounded Rationality

Ortega, Pedro A., Braun, Daniel A., Dyer, Justin, Kim, Kee-Eung, Tishby, Naftali

arXiv.org Machine LearningDec-21-2015

Bounded rationality, that is, decision-making and planning under resource limitations, is widely regarded as an important open problem in artificial intelligence, reinforcement learning, computational neuroscience and economics. This paper offers a consolidated presentation of a theory of bounded rationality based on information-theoretic ideas. We provide a conceptual justification for using the free energy functional as the objective function for characterizing bounded-rational decisions. This functional possesses three crucial properties: it controls the size of the solution space; it has Monte Carlo planners that are exact, yet bypass the need for exhaustive search; and it captures model uncertainty arising from lack of evidence or from interacting with other agents having unknown intentions. We discuss the single-step decision-making case, and show how to extend it to sequential decisions using equivalence transformations. This extension yields a very general class of decision problems that encompass classical decision rules (e.g.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Machine Learning

1512.06789

Country:

North America > United States > Massachusetts (0.28)
North America > United States > Pennsylvania (0.28)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.28)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Neurology (0.68)
Education (0.48)
Energy (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
(4 more...)

Add feedback

Facility Deployment Decisions through Warp Optimizaton of Regressed Gaussian Processes

Scopatz, Anthony

arXiv.org Machine LearningDec-21-2015

University of South Carolina, Department of Mechanical Engineering, Nuclear Engineering Program, Columbia, SC 29201 Send proofs to: Anthony M. Scopatz scopatz@cec.sc.edu 541 Main Street, Columbia, SC 29208 Number of Pages: 35 Number of Tables: 0 Number of Figures: 11 Keywords: nuclear fuel cycle, gaussian process, dynamic time warping Abstract A method for quickly determining deployment schedules that meet a given fuel cycle demand is presented here. This algorithm is fast enough to perform in situ within low-fidelity fuel cycle simulators. It uses Gaussian process regression models to predict the production curve as a function of time and the number of deployed facilities. Each of these predictions is measured against the demand curve using the dynamic time warping distance. The minimum distance deployment schedule is evaluated in a full fuel cycle simulation, whose generated production curve then informs the model on the next optimization iteration. The method converges within five to ten iterations to a distance that is less than one percent of the total deployable production. A representative once-through fuel cycle is used to demonstrate the methodology for reactor deployment. I INTRODUCTION With the recent advent of agent-based nuclear fuel cycle simulators, such as Cyclus [1, 2], there comes the possibility to make in situ, dynamic facility deployment decisions. This would more fully model real-world fuel cycles where institutions (such as utility companies) predict future demand and choose their future deployment schedules appropriately. However, one of the major challenges to making in situ deployment decisions is the speed at which "good enough" decisions can be made. This paper proposes three related deployment-specific optimization algorithms that can be used for any demand curve and facility type. The demands of a fuel cycle scenario can often be simply stated, e.g. Here, the dynamic time warping (DTW) [3] distance is minimized between the demand curve and the regression of a Gaussian Process model (GP) [4] of prior simulations. This minimization produces a guess for a deployment schedule which is subsequently tested using an actual simulator. This process is repeated until an optimal deployment schedule for the given demand is found. Importantly, by using the Gaussian process surrogates, the number of simulation realizations that must be executed as part of the optimization may be reduced to only a handful. Furthermore, it is at least two orders-of-magnitude faster to test the model than it is to run a single low-fidelity fuel cycle simulation. Because of the relative computational cheapness, it is suitable to be used inside of a fuel cycle simulation.

artificial intelligence, machine learning, modeling & simulation, (17 more...)

arXiv.org Machine Learning

1512.06929

Country: North America > United States > South Carolina > Richland County > Columbia (0.44)

Genre: Research Report (0.40)

Industry: Energy > Power Industry > Utilities > Nuclear (0.68)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Asymptotic Behavior of Minimal-Exploration Allocation Policies: Almost Sure, Arbitrarily Slow Growing Regret

Cowan, Wesley, Katehakis, Michael N.

arXiv.org Machine LearningDec-17-2015

The purpose of this paper is to provide further understanding into the structure of the sequential allocation ("stochastic multi-armed bandit", or MAB) problem by establishing probability one finite horizon bounds and convergence rates for the sample (or "pseudo") regret associated with two simple classes of allocation policies $\pi$. For any slowly increasing function $g$, subject to mild regularity constraints, we construct two policies (the $g$-Forcing, and the $g$-Inflated Sample Mean) that achieve a measure of regret of order $ O(g(n))$ almost surely as $n \to \infty$, bound from above and below. Additionally, almost sure upper and lower bounds on the remainder term are established. In the constructions herein, the function $g$ effectively controls the "exploration" of the classical "exploration/exploitation" tradeoff.

artificial intelligence, bandit, big data, (18 more...)

arXiv.org Machine Learning

1505.02865

Country: North America > United States (0.28)

Genre: Research Report (0.40)

Industry: Energy > Oil & Gas (0.36)

Technology:

Information Technology > Artificial Intelligence (0.68)
Information Technology > Data Science > Data Mining > Big Data (0.66)

Add feedback

Bayesian Policy Reuse

Rosman, Benjamin, Hawasly, Majd, Ramamoorthy, Subramanian

arXiv.org Artificial IntelligenceDec-14-2015

A long-lived autonomous agent should be able to respond online to novel instances of tasks from a familiar domain. Acting online requires 'fast' responses, in terms of rapid convergence, especially when the task instance has a short duration, such as in applications involving interactions with humans. These requirements can be problematic for many established methods for learning to act. In domains where the agent knows that the task instance is drawn from a family of related tasks, albeit without access to the label of any given instance, it can choose to act through a process of policy reuse from a library, rather than policy learning from scratch. In policy reuse, the agent has prior knowledge of the class of tasks in the form of a library of policies that were learnt from sample task instances during an offline training phase. We formalise the problem of policy reuse, and present an algorithm for efficiently responding to a novel task instance by reusing a policy from the library of existing policies, where the choice is based on observed 'signals' which correlate to policy performance. We achieve this by posing the problem as a Bayesian choice problem with a corresponding notion of an optimal response, but the computation of that response is in many cases intractable. Therefore, to reduce the computation cost of the posterior, we follow a Bayesian optimisation approach and define a set of policy selection functions, which balance exploration in the policy library against exploitation of previously tried policies, together with a model of expected performance of the policy library on their corresponding task instances. We validate our method in several simulated domains of interactive, short-duration episodic tasks, showing rapid convergence in unknown task variations.

agent, bayesian inference, upstream oil & gas, (20 more...)

arXiv.org Artificial Intelligence

1505.00284

Country: Africa (0.14)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment > Sports > Golf (0.68)
Education (0.67)
Energy > Oil & Gas > Upstream (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Data Science > Data Mining (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.87)

Add feedback

Optimal strategies for the control of autonomous vehicles in data assimilation

McDougall, Damon, Moore, Richard

arXiv.org Machine LearningDec-7-2015

We propose a method to compute optimal control paths for autonomous vehicles deployed for the purpose of inferring a velocity field. In addition to being advected by the flow, the vehicles are able to effect a fixed relative speed with arbitrary control over direction. It is this direction that is used as the basis for the locally optimal control algorithm presented here, with objective formed from the variance trace of the expected posterior distribution. We present results for linear flows near hyperbolic fixed points. Keywords: Bayesian inverse problem, Lagrangian data assimilation, Optimal control, Ocean glider 2010 MSC: 49M, 62F, 62L, 93C, 65C 1. Introduction The need for a more accurate and better resolved estimate of oceanic flows is being driven by a number of pressing global issues, including the crisis facing many species of fish and waterborne organisms, the mitigation of pollutants resulting from spills and offshore contamination, and the important role played by ocean dynamics on climate change. Scientific efforts to estimate ocean flow began in the 1980s with the work of Robinson [1], but has enjoyed limited success due to a lack of observational data. In an effort to improve the current state of understanding of the world's oceans, autonomous vehicles (AVs) are being deployed for the collection of physical oceanography data in a growing number of projects around the globe.

assimilation, glider, optimal control, (16 more...)

arXiv.org Machine Learning

1512.02271

Country:

North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
North America > United States > Ohio (0.04)
(4 more...)

Genre: Research Report (0.50)

Industry:

Energy (0.46)
Transportation > Passenger (0.41)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.90)

Add feedback

Iteratively reweighted adaptive lasso for conditional heteroscedastic time series with applications to AR-ARCH type processes

Ziel, Florian

arXiv.org Machine LearningDec-5-2015

Shrinkage algorithms are of great importance in almost every area of statistics due to the increasing impact of big data. Especially time series analysis benefits from efficient and rapid estimation techniques such as the lasso. However, currently lasso type estimators for autoregressive time series models still focus on models with homoscedastic residuals. Therefore, an iteratively reweighted adaptive lasso algorithm for the estimation of time series models under conditional heteroscedasticity is presented in a high-dimensional setting. The asymptotic behaviour of the resulting estimator is analysed. It is found that the proposed estimation procedure performs substantially better than its homoscedastic counterpart. A special case of the algorithm is suitable to compute the estimated multivariate AR-ARCH type models efficiently. Extensions to the model like periodic AR-ARCH, threshold AR-ARCH or ARMA-GARCH are discussed. Finally, different simulation results and applications to electricity market data and returns of metal prices are shown.

artificial intelligence, assumption, machine learning, (18 more...)

arXiv.org Machine Learning

doi: 10.1016/j.csda.2015.11.016

1502.06557

Genre: Research Report (0.40)

Industry:

Energy > Power Industry (0.69)
Materials > Metals & Mining (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback