Energy
Block-Sparse Recurrent Neural Networks
Narang, Sharan, Undersander, Eric, Diamos, Gregory
Recurrent Neural Networks (RNNs) are used in state-of-the-art models in domains such as speech recognition, machine translation, and language modelling. Sparsity is a technique to reduce compute and memory requirements of deep learning models. Sparse RNNs are easier to deploy on devices and high-end server processors. Even though sparse operations need less compute and memory relative to their dense counterparts, the speed-up observed by using sparse operations is less than expected on different hardware platforms. In order to address this issue, we investigate two different approaches to induce block sparsity in RNNs: pruning blocks of weights in a layer and using group lasso regularization to create blocks of weights with zeros. Using these techniques, we demonstrate that we can create block-sparse RNNs with sparsity ranging from 80% to 90% with small loss in accuracy. This allows us to reduce the model size by roughly 10x. Additionally, we can prune a larger dense network to recover this loss in accuracy while maintaining high block sparsity and reducing the overall parameter count. Our technique works with a variety of block sizes up to 32x32. Block-sparse RNNs eliminate overheads related to data storage and irregular memory accesses while increasing hardware efficiency compared to unstructured sparsity.
Deep Fault Analysis and Subset Selection in Solar Power Grids
Bhattacharya, Biswarup, Sinha, Abhishek
Non-availability of reliable and sustainable electric power is a major problem in the developing world. Renewable energy sources like solar are not very lucrative in the current stage due to various uncertainties like weather, storage, land use among others. There also exists various other issues like mis-commitment of power, absence of intelligent fault analysis, congestion, etc. In this paper, we propose a novel deep learning-based system for predicting faults and selecting power generators optimally so as to reduce costs and ensure higher reliability in solar power systems. The results are highly encouraging and they suggest that the approaches proposed in this paper have the potential to be applied successfully in the developing world.
Online Tool Condition Monitoring Based on Parsimonious Ensemble+
Pratama, Mahardhika, Dimla, Eric, Lughofer, Edwin, Pedrycz, Witold, Tjahjowidowo, Tegoeh
Accurate diagnosis of tool wear in metal turning process remains an open challenge for both scientists and industrial practitioners because of inhomogeneities in workpiece material, nonstationary machining settings to suit production requirements, and nonlinear relations between measured variables and tool wear. Common methodologies for tool condition monitoring still rely on batch approaches which cannot cope with a fast sampling rate of metal cutting process. Furthermore they require a retraining process to be completed from scratch when dealing with a new set of machining parameters. This paper presents an online tool condition monitoring approach based on Parsimonious Ensemble+, pENsemble+. The unique feature of pENsemble+ lies in its highly flexible principle where both ensemble structure and base-classifier structure can automatically grow and shrink on the fly based on the characteristics of data streams. Moreover, the online feature selection scenario is integrated to actively sample relevant input attributes. The paper presents advancement of a newly developed ensemble learning algorithm, pENsemble+, where online active learning scenario is incorporated to reduce operator labelling effort. The ensemble merging scenario is proposed which allows reduction of ensemble complexity while retaining its diversity. Experimental studies utilising real-world manufacturing data streams and comparisons with well known algorithms were carried out. Furthermore, the efficacy of pENsemble was examined using benchmark concept drift data streams. It has been found that pENsemble+ incurs low structural complexity and results in a significant reduction of operator labelling effort.
Estimating Cosmological Parameters from the Dark Matter Distribution
Ravanbakhsh, Siamak, Oliva, Junier, Fromenteau, Sebastien, Price, Layne C., Ho, Shirley, Schneider, Jeff, Poczos, Barnabas
A grand challenge of the 21st century cosmology is to accurately estimate the cosmological parameters of our Universe. A major approach to estimating the cosmological parameters is to use the large-scale matter distribution of the Universe. Galaxy surveys provide the means to map out cosmic large-scale structure in three dimensions. Information about galaxy locations is typically summarized in a "single" function of scale, such as the galaxy correlation function or power-spectrum. We show that it is possible to estimate these cosmological parameters directly from the distribution of matter. This paper presents the application of deep 3D convolutional networks to volumetric representation of dark-matter simulations as well as the results obtained using a recently proposed distribution regression framework, showing that machine learning techniques are comparable to, and can sometimes outperform, maximum-likelihood point estimates using "cosmological models". This opens the way to estimating the parameters of our Universe with higher accuracy.
Copyright Law Makes Artificial Intelligence Bias Worse
Last week, Motherboard discovered that one of Google's machine learning algorithms was biased against certain racial and religious groups, as well as LGBT people. The Cloud Natural Language API analyzes paragraphs of text and then determines whether they have a positive or negative "sentiment." The algorithm rated statements like "I'm a homosexual," and "I'm a gay black woman," as negative. After we ran our story, Google apologized. The incident marks the latest in a series in which artificial intelligence algorithms have been found to be biased.
Artificial Intelligence and the future of energy โ WePower โ Medium
With the rise of cloud computing and the ever-decreasing costs associated with computations, now and in the future this technology will be more and more widely available. One of the most process heavy steps in AI systems is model training and validation. Being able to pay per minute or even second for the use of computing power removes the need for large upfront investment and data centre maintenance costs. With Google Cloud, IBM Bluemix and Amazon Cloud the power to perform highly complex computations is readily available for everyone today [11]. The systems architecture for machine learning which underpins artificial intelligence is also seamlessly provided by cloud solutions.
Inference Emerges As Next AI Challenge
As developers flock to artificial intelligence frameworks in response to the explosion of intelligence machines, training deep learning models has emerged as a priority along with synching them to a growing list of neural and other network designs. All are being aligned to confront some of the next big AI challenges, including training deep learning models to make inferences from the fire hose of unstructured data. These and other AI developer challenges were highlighted during this week's Nvidia GPU technology conference in Washington. The GPU leader uses the events to bolster its contention that GPUs--some with up to 5,000 cores--are filling the computing gap created by the decline of Moore's Law. The other driving force behind the "era of AI" is the emergence of algorithm-driven deep learning that is forcing developers to move beyond mere coding to apply AI to a growing range of automated processes and predictive analytics.
Can Blockchain And AI Accelerate The Arrival Of The IoT Economy?
Advancement in AI and Bolckchain may create a new IOT economy. The collective network of connected devices known as the Internet-of-Things (IoT) is growing. Gartner estimates that there will be 8.4B IoT devices by 2020. The bulk of this growth is expected to come from consumer devices as more consumers acquire smart devices and appliances. Businesses are also expected to ramp up adoption to improve workplace productivity and maximize industrial applications.
Generalized Probabilistic Bisection for Stochastic Root-Finding
Rodriguez, Sergio, Ludkovski, Michael
We consider numerical schemes for root finding of noisy responses through generalizing the Probabilistic Bisection Algorithm (PBA) to the more practical context where the sampling distribution is unknown and location-dependent. As in standard PBA, we rely on a knowledge state for the approximate posterior of the root location. To implement the corresponding Bayesian updating, we also carry out inference of oracle accuracy, namely learning the probability of correct response. To this end we utilize batched querying in combination with a variety of frequentist and Bayesian estimators based on majority vote, as well as the underlying functional responses, if available. For guiding sampling selection we investigate both Information Directed sampling, as well as Quantile sampling. Our numerical experiments show that these strategies perform quite differently; in particular we demonstrate the efficiency of randomized quantile sampling which is reminiscent of Thompson sampling. Our work is motivated by the root-finding sub-routine in pricing of Bermudan financial derivatives, illustrated in the last section of the paper.
Revolutionization of Artificial Intelligence DataScience.US
In this digital age, AI has shaped the way we are living, it is seeping its way into our lives, affecting how we live, work and entertain ourselves. From voice-powered personal assistants like Siri and Alexa to more underlying and fundamental technologies such as behavioral algorithms, suggestive searches and autonomously-powered self-driving vehicles boasting powerful predictive capabilities, there are several examples and applications of artificial intelligence in use today. Artificial Intelligence implementation can greatly enhance productivity in all aspect of life, but it can also be disruptive if not properly managed. Today, Artificial Intelligence technology allows non-living things like robots & computers to think for themselves to an extent. Artificial Intelligence and Man Artificial Neural Systems (ANS) A neural network is an electronic model of the brain consisting of many interconnected simple processors.