Goto

Collaborating Authors

 Energy


Reinforcement Learning with Probabilistically Complete Exploration

arXiv.org Machine Learning

Balancing exploration and exploitation remains a key challenge in reinforcement learning (RL). State-of-the-art RL algorithms suffer from high sample complexity, particularly in the sparse reward case, where they can do no better than to explore in all directions until the first positive rewards are found. To mitigate this, we propose Rapidly Randomly-exploring Reinforcement Learning (R3L). We formulate exploration as a search problem and leverage widely-used planning algorithms such as Rapidly-exploring Random Tree (RRT) to find initial solutions. These solutions are used as demonstrations to initialize a policy, then refined by a generic RL algorithm, leading to faster and more stable convergence. We provide theoretical guarantees of R3L exploration finding successful solutions, as well as bounds for its sampling complexity. We experimentally demonstrate the method outperforms classic and intrinsic exploration techniques, requiring only a fraction of exploration samples and achieving better asymptotic performance.


CNN-based InSAR Denoising and Coherence Metric

arXiv.org Machine Learning

-- Interferometric Synthetic Aperture Radar (InSAR) imagery for estimating ground movement, based on micro waves reflected off ground targets is gaining increasing importance in remote sensing. However, noise corrupts microwave reflections received at satellite and contaminates the signal's wrapped phase. We introduce Convolutional Neural Networks (CNNs) to thi s problem domain and show the effectiveness of autoencoder CNN architectures to learn InSAR image denoising filters in the absence of clean ground truth images, and for artefact reduction in estimated coherence through intelligent preprocessing of training data. We compare our results with four established methods to illustrate superiority of proposed method . Remote sensing using activate microwave, especially in t he form of Synthetic Aperture Radar Interferometry (InSAR), has been extensively used in decades .


Why digital transformation is now on the CEOโ€™s shoulders

#artificialintelligence

When science and technology meet social and economic systems, you tend to see something akin to what the late Stephen Jay Gould called "punctuated equilibrium" in his description of evolutionary biology. Something that has been stable for a long period is suddenly disrupted radically--and then settles into a new equilibrium.1 1.See Stephen Jay Gould, Punctuated Equilibrium, Cambridge, MA: Harvard University Press, 2007. Gould pointed out that fossil records show that species change does not advance gradually but often massively and disruptively. After the mass extinctions that have occurred several times across evolutionary eras, a minority of species survived and the voids in the ecosystem rapidly filled with massive speciation. Gould's theory addresses the discontinuity in fossil records that puzzled Charles Darwin.


Technology firms vie for billions in data-analytics contracts

#artificialintelligence

SOMEBODY LESS driven than Tom Siebel would have long since thrown in the towel. In 2006 the entrepreneur, then 53 years old, sold his first firm, Siebel Systems, which made computer programs to track customer relations, to Oracle, a giant of business software. That left him a billionaire--but a restless one. In 2009, a few months after Mr Siebel had launched a new startup, he was trampled by an elephant while on safari in Tanzania. When, a dozen surgeries later, he could work again, the enterprise almost went bankrupt.


Intelligent Road Inspection with Advanced Machine Learning; Hybrid Prediction Models for Smart Mobility and Transportation Maintenance Systems

arXiv.org Machine Learning

School of the Built Environment, Oxford Brookes University, Oxford OX3 0BP, UK; a. mosavi@brookes.ac.uk Abstract: Prediction models in mobility and transportation maintenance systems have been dramatically improved through using machine learning methods . The traditional road inspecti on systems based on the pavement condition index (PCI) are often associated with the critical safety, energy and cost issues. Alternatively, t he proposed models utilize surface deflection data from falling weight deflectometer (FWD) test s to predict the PC I. Machine learning methods are the single multi - layer perceptron (MLP) and radial basis function (RBF) neural networks as well their hybrids, i.e., L eve nberg - M arquardt (MLP - LM), scaled conjugate gradient (MLP - SCG), imperialist competitive (RBF - ICA), and g enetic algorithms (RBF - GA). Furthermore, the committee machine intelligent systems (CMIS) method was adopted to combine the results and improve the accur acy of the modeling. The results of the analysis have been verified through using four criteria of aver age percent relative error (APRE), average absolute percent relative error (AAPRE), root mean square error (RMSE), and standard error (SD). The CMIS mode l outperforms other models with the promising results of APRE 2.3303, AAPRE 11.6768, RMSE 12.0056, and SD 0.0210. Introduction In road transportation, pavement plays a vital role as th e part of the road that is in direct contact with vehicles . U sers' judgment about the quality of road service is primarily predicated upon pavement conditions. The Maintena nce, Rehabilitation, and Reconstruction (MR&R) program of pavement network is a multidimensional decision - making process that takes into account several consideration s.


A survey on Machine Learning-based Performance Improvement of Wireless Networks: PHY, MAC and Network layer

arXiv.org Machine Learning

This paper provides a systematic and comprehensive survey that reviews the latest research efforts focused on machine learning (ML) based performance improvement of wireless networks, while considering all layers of the protocol stack (PHY, MAC and network). First, the related work and paper contributions are discussed, followed by providing the necessary background on data-driven approaches and machine learning for non-machine learning experts to understand all discussed techniques. Then, a comprehensive review is presented on works employing ML-based approaches to optimize the wireless communication parameters settings to achieve improved network quality-of-service (QoS) and quality-of-experience (QoE). We first categorize these works into: radio analysis, MAC analysis and network prediction approaches, followed by subcategories within each. Finally, open challenges and broader perspectives are discussed.


Up to two billion times acceleration of scientific simulations with deep neural architecture search

arXiv.org Machine Learning

Computer simulations are invaluable tools for scientific discovery. However, accurate simulations are often slow to execute, which limits their applicability to extensive parameter exploration, large-scale data analysis, and uncertainty quantification. A promising route to accelerate simulations by building fast emulators with machine learning requires large training datasets, which can be prohibitively expensive to obtain with slow simulations. Here we present a method based on neural architecture search to build accurate emulators even with a limited number of training data. The method successfully accelerates simulations by up to 2 billion times in 10 scientific cases including astrophysics, climate science, biogeochemistry, high energy density physics, fusion energy, and seismology, using the same super-architecture, algorithm, and hyperparameters. Our approach also inherently provides emulator uncertainty estimation, adding further confidence in their use. We anticipate this work will accelerate research involving expensive simulations, allow more extensive parameters exploration, and enable new, previously unfeasible computational discovery.


FedVision: An Online Visual Object Detection Platform Powered by Federated Learning

arXiv.org Machine Learning

Visual object detection is a computer vision-based artificial intelligence (AI) technique which has many practical applications (e.g., fire hazard monitoring). However, due to privacy concerns and the high cost of transmitting video data, it is highly challenging to build object detection models on centrally stored large training datasets following the current approach. Federated learning (FL) is a promising approach to resolve this challenge. Nevertheless, there currently lacks an easy to use tool to enable computer vision application developers who are not experts in federated learning to conveniently leverage this technology and apply it in their systems. In this paper, we report FedVision - a machine learning engineering platform to support the development of federated learning powered computer vision applications. The platform has been deployed through a collaboration between WeBank and Extreme Vision to help customers develop computer vision-based safety monitoring solutions in smart city applications. Over four months of usage, it has achieved significant efficiency improvement and cost reduction while removing the need to transmit sensitive data for three major corporate customers. To the best of our knowledge, this is the first real application of FL in computer vision-based tasks.


FlexiBO: Cost-Aware Multi-Objective Optimization of Deep Neural Networks

arXiv.org Machine Learning

One of the key challenges in designing machine learning systems is to determine the right balance amongst several objectives, which also oftentimes are incommensurable and conflicting. For example, when designing deep neural networks (DNNs), one often has to trade-off between multiple objectives, such as accuracy, energy consumption, and inference time. Typically, there is no single configuration that performs equally well for all objectives. Consequently, one is interested in identifying Pareto-optimal designs. Although different multi-objective optimization algorithms have been developed to identify Pareto-optimal configurations, state-of-the-art multi-objective optimization methods do not consider the different evaluation costs attending the objectives under consideration. This is particularly important for optimizing DNNs: the cost arising on account of assessing the accuracy of DNNs is orders of magnitude higher than that of measuring the energy consumption of pre-trained DNNs. We propose FlexiBO, a flexible Bayesian optimization method, to address this issue. We formulate a new acquisition function based on the improvement of the Pareto hyper-volume weighted by the measurement cost of each objective. Our acquisition function selects the next sample and objective that provides maximum information gain per unit of cost. We evaluated FlexiBO on 7 state-of-the-art DNNs for object detection, natural language processing, and speech recognition. Our results indicate that, when compared to other state-of-the-art methods across the 7 architectures we tested, the Pareto front obtained using FlexiBO has, on average, a 28.44% higher contribution to the true Pareto front and achieves 25.64% better diversity.


Neighborhood Structure Assisted Non-negative Matrix Factorization and its Application in Unsupervised Point Anomaly Detection

arXiv.org Machine Learning

Dimensionality reduction is considered as an important step for ensuring competitive performance in unsupervised learning such as anomaly detection. Non-negative matrix factorization (NMF) is a popular and widely used method to accomplish this goal. But NMF, together with its recent, enhanced version, like graph regularized NMF or symmetric NMF, do not have the provision to include the neighborhood structure information and, as a result, may fail to provide satisfactory performance in presence of nonlinear manifold structure. To address that shortcoming, we propose to consider and incorporate the neighborhood structural similarity information within the NMF framework by modeling the data through a minimum spanning tree. What motivates our choice is the understanding that in the presence of complicated data structure, a minimum spanning tree can approximate the intrinsic distance between two data points better than a simple Euclidean distance does, and consequently, it constitutes a more reasonable basis for differentiating anomalies from the normal class data. We label the resulting method as the neighborhood structure assisted NMF. By comparing the formulation and properties of the neighborhood structure assisted NMF with other versions of NMF including graph regularized NMF and symmetric NMF, it is apparent that the inclusion of the neighborhood structure information using minimum spanning tree makes a key difference. We further devise both offline and online algorithmic versions of the proposed method. Empirical comparisons using twenty benchmark datasets as well as an industrial dataset extracted from a hydropower plant demonstrate the superiority of the neighborhood structure assisted NMF and support our claim of merit.