Energy
AI's carbon footprint problem
For all the advances enabled by artificial intelligence, from speech recognition to self-driving cars, AI systems consume a lot of power and can generate high volumes of climate-changing carbon emissions. A study last year found that training an off-the-shelf AI language-processing system produced 1,400 pounds of emissions -- about the amount produced by flying one person roundtrip between New York and San Francisco. The full suite of experiments needed to build and train that AI language system from scratch can generate even more: up to 78,000 pounds, depending on the source of power. But there are ways to make machine learning cleaner and greener, a movement that has been called "Green AI." Some algorithms are less power-hungry than others, for example, and many training sessions can be moved to remote locations that get most of their power from renewable sources.
Information Theory in Density Destructors
Johnson, J. Emmanuel, Laparra, Valero, Camps-Valls, Gustau, Santos-Rodrรญguez, Raul, Malo, Jesรบs
Density destructors are differentiable and invertible transforms that map multivariate PDFs of arbitrary structure (low entropy) into non-structured PDFs (maximum entropy). Multivariate Gaussianization and multivariate equalization are specific examples of this family, which break down the complexity of the original PDF through a set of elementary transforms that progressively remove the structure of the data. We demonstrate how this property of density destructive flows is connected to classical information theory, and how density destructors can be used to get more accurate estimates of information theoretic quantities. Experiments with total correlation and mutual information inmultivariate sets illustrate the ability of density destructors compared to competing methods. These results suggest that information theoretic measures may be an alternative optimization criteria when learning density destructive flows.
Deep Spectral CNN for Laser Induced Breakdown Spectroscopy
Castorena, Juan, Oyen, Diane, Ollila, Ann, Legget, Carey, Lanza, Nina
This work proposes a spectral convolutional neural network (CNN) operating on laser induced breakdown spectroscopy (LIBS) signals to learn to (1) disentangle spectral signals from the sources of sensor uncertainty (i.e., pre-process) and (2) get qualitative and quantitative measures of chemical content of a sample given a spectral signal (i.e., calibrate). Once the spectral CNN is trained, it can accomplish either task through a single feed-forward pass, with real-time benefits and without any additional side information requirements including dark current, system response, temperature and detector-to-target range. Our experiments demonstrate that the proposed method outperforms the existing approaches used by the Mars Science Lab for pre-processing and calibration for remote sensing observations from the Mars rover, 'Curiosity'.
The temporal overfitting problem with applications in wind power curve modeling
Prakash, Abhinav, Tuo, Rui, Ding, Yu
This paper is concerned with a nonparametric regression problem in which the independence assumption of the input variables and the residuals is no longer valid. Using existing model selection methods, like cross validation, the presence of temporal autocorrelation in the input variables and the error terms leads to model overfitting. This phenomenon is referred to as temporal overfitting, which causes loss of performance while predicting responses for a time domain different from the training time domain. We propose a new method to tackle the temporal overfitting problem. Our nonparametric model is partitioned into two parts -- a time-invariant component and a time-varying component, each of which is modeled through a Gaussian process regression. The key in our inference is a thinning-based strategy, an idea borrowed from Markov chain Monte Carlo sampling, to estimate the two components, respectively. Our specific application in this paper targets the power curve modeling in wind energy. In our numerical studies, we compare extensively our proposed method with both existing power curve models and available ideas for handling temporal overfitting. Our approach yields significant improvement in prediction both in and outside the time domain covered by the training data.
Deep learning based numerical approximation algorithms for stochastic partial differential equations and high-dimensional nonlinear filtering problems
Beck, Christian, Becker, Sebastian, Cheridito, Patrick, Jentzen, Arnulf, Neufeld, Ariel
In this article we introduce and study a deep learning based approximation algorithm for solutions of stochastic partial differential equations (SPDEs). In the proposed approximation algorithm we employ a deep neural network for every realization of the driving noise process of the SPDE to approximate the solution process of the SPDE under consideration. We test the performance of the proposed approximation algorithm in the case of stochastic heat equations with additive noise, stochastic heat equations with multiplicative noise, stochastic Black--Scholes equations with multiplicative noise, and Zakai equations from nonlinear filtering. In each of these SPDEs the proposed approximation algorithm produces accurate results with short run times in up to 50 space dimensions.
Residuals-based distributionally robust optimization with covariate information
Kannan, Rohit, Bayraksan, Gรผzin, Luedtke, James R.
We consider data-driven approaches that integrate a machine learning prediction model within distributionally robust optimization (DRO) given limited joint observations of uncertain parameters and covariates. Our framework is flexible in the sense that it can accommodate a variety of learning setups and DRO ambiguity sets. We investigate the asymptotic and finite sample properties of solutions obtained using Wasserstein, sample robust optimization, and phi-divergence-based ambiguity sets within our DRO formulations, and explore cross-validation approaches for sizing these ambiguity sets. Through numerical experiments, we validate our theoretical results, study the effectiveness of our approaches for sizing ambiguity sets, and illustrate the benefits of our DRO formulations in the limited data regime even when the prediction model is misspecified.
Welcome
AI can benefit society in many ways but, given the energy needed to support the computing behind AI, these benefits can come at a high environmental price. CodeCarbon is a lightweight software package that seamlessly integrates into your Python codebase. It estimates the amount of carbon dioxide (CO2) produced by the cloud or personal computing resources used to execute the code.
Tantech Subsidiary Launches Newest Driverless and Autonomous Street Sweeper
Tantech Holdings Ltd (NASDAQ: TANH) ("Tantech" or the "Company"), a clean energy company in China, today announced the launch by its subsidiary, Shangchi Automobile Co., Ltd. The Shangchi SC-100A follows the launch last month of the SC-120A model featuring unmanned, automatic sweeping. Shangchi Automobile's innovative driverless and autonomous street sweepers are designed for quieter operation and improved cleaning performance, with the ability to reduce or eliminate the 7 to 8 humans required for typical sweeper vehicle operation. Lidar-based, machine vision technology provides long-distance detection and obstacle identification, with sensors for short-distance obstacle detection and avoidance. This enables the driverless model to safely and accurately operate in common environments.
Artificial Intelligence Will Revolutionize Energy, Earning Billions For Investors
As the world is anticipating the end of the COVID-19 pandemic, energy consumption in industry and services is likely to grow. In the longer term, the developing world will increase its energy utilization, leading to growth of global primary energy demand by of 0.4% - 0.6% per year, or a 25% increase by 2050. According to scenarios calculated by energy giant Total SE, massive electrification of transportation will lead to decarbonization, and will require a rapid growth in renewables as a source of electricity. This energy transformation will see an explosion of growth in Artificial Intelligence (AI) utilization in the sector โ up 50% between 2020 and 2024 โ to allow smart, 21st century grids to become the gold standard, gradually replacing the "dumb" grids laid down in the late 19th โ early 20th century in Europe, North America, Japan, China and beyond. The grid is a meta-system of generation facilities, be it nuclear, gas, coal, solar, wind, and hydro, connected by high voltage wire networks to transformers, and then to sub-stations and individual buildings, households, and apartments.
Reinforcement Learning to Reduce Building Energy Consumption
The need for Energy Savings has become increasily foundamental to fight Climate Change. We have been working on a cloud-based RL algorithm that can retrofit existing HVAC controls to obtain substantial results. In the last decade, a new class of controls which relies on Artificial Intelligence have been proposed. In particular, we are going to highlight data-driven controls based on Reinforcement Learning (RL), since they showed from the very beginning promising results as HVAC controls [2]. There are two main ways to upgrade with RL the air conditioning systems: to implement RL on new systems or to retrofit the existing ones.