Energy
Anchored Correlation Explanation: Topic Modeling with Minimal Domain Knowledge
Gallagher, Ryan J., Reing, Kyle, Kale, David, Steeg, Greg Ver
While generative models such as Latent Dirichlet Allocation (LDA) have proven fruitful in topic modeling, they often require detailed assumptions and careful specification of hyperparameters. Such model complexity issues only compound when trying to generalize generative models to incorporate human input. We introduce Correlation Explanation (CorEx), an alternative approach to topic modeling that does not assume an underlying generative model, and instead learns maximally informative topics through an information-theoretic framework. This framework naturally generalizes to hierarchical and semi-supervised extensions with no additional modeling assumptions. In particular, word-level domain knowledge can be flexibly incorporated within CorEx through anchor words, allowing topic separability and representation to be promoted with minimal human intervention. Across a variety of datasets, metrics, and experiments, we demonstrate that CorEx produces topics that are comparable in quality to those produced by unsupervised and semi-supervised variants of LDA.
Gaussian Process Regression for Arctic Coastal Erosion Forecasting
Kupilik, Matthew, Witmer, Frank, MacLeod, Euan-Angus, Wang, Caixia, Ravens, Tom
Arctic coastal morphology is governed by multiple factors, many of which are affected by climatological changes. As the season length for shorefast ice decreases and temperatures warm permafrost soils, coastlines are more susceptible to erosion from storm waves. Such coastal erosion is a concern, since the majority of the population centers and infrastructure in the Arctic are located near the coasts. Stakeholders and decision makers increasingly need models capable of scenario-based predictions to assess and mitigate the effects of coastal morphology on infrastructure and land use. Our research uses Gaussian process models to forecast Arctic coastal erosion along the Beaufort Sea near Drew Point, AK. Gaussian process regression is a data-driven modeling methodology capable of extracting patterns and trends from data-sparse environments such as remote Arctic coastlines. To train our model, we use annual coastline positions and near-shore summer temperature averages from existing datasets and extend these data by extracting additional coastlines from satellite imagery. We combine our calibrated models with future climate models to generate a range of plausible future erosion scenarios. Our results show that the Gaussian process methodology substantially improves yearly predictions compared to linear and nonlinear least squares methods, and is capable of generating detailed forecasts suitable for use by decision makers.
New-gen technologies make IoT transformational
Over the last few years, many people--myself included--have been touting the Internet of Things (IoT) as a driving force behind digital transformation. But is IoT by itself truly that transformational? Well, I would argue that it is not. IoT focuses mainly on securely connecting devices that generate data. It is a key element of disruption and change, but it needs to partner with other technologies such as artificial intelligence (AI), blockchain and fog computing to create billions--some say trillions--of dollars in value and transform industries.
Scaling deep learning for science
Deep neural networks--a form of artificial intelligence--have demonstrated mastery of tasks once thought uniquely human. Their triumphs have ranged from identifying animals in images, to recognizing human speech, to winning complex strategy games, among other successes. Now, researchers are eager to apply this computational technique--commonly referred to as deep learning--to some of science's most persistent mysteries. But because scientific data often looks much different from the data used for animal photos and speech, developing the right artificial neural network can feel like an impossible guessing game for nonexperts. To expand the benefits of deep learning for science, researchers need new tools to build high-performing neural networks that don't require specialized knowledge.
Holiday Gift Guide 2017
Circuit Breaker Labs recycles old printed circuit boards (complete with components) into bracelets, earrings, necklaces, cuff links, tie bars, and even retractable badge holders. Prices range from $21 to $345 for individual pieces, and gift sets are available. Another electronics-pride jewelry option is provided by Lumen Electronic Jewelry. The company has created a range of solar-powered, LED-equipped necklaces, earrings, and tie clips that can blink long into the night, with charge stored in capacitors rather than bulky batteries (pieces cost between $85 and $225).
How Artificial Intelligence Is Powering Everyday Tasks
To fans of science fiction, artificial intelligence may remind them of robots like C-3PO, the loquacious but harmless golden droid in Star Wars, or Skynet in the Terminator movies, a calculating sentient computer that subjugated mankind. But AI is more than just a machine with human-level intelligence scientists hope they could one day create. It is a set of algorithms and technologies that is already powering many tasks in everyday life. Get our free ebook on how the Soviet Union became Putin's Russia. Chatbots that converse with you in Yahoo, Facebook and other sites use AI.
On reducing the communication cost of the diffusion LMS algorithm
Harrane, Ibrahim El Khalil, Flamary, Rรฉmi, Richard, Cรฉdric
The rise of digital and mobile communications has recently made the world more connected and networked, resulting in an unprecedented volume of data flowing between sources, data centers, or processes. While these data may be processed in a centralized manner, it is often more suitable to consider distributed strategies such as diffusion as they are scalable and can handle large amounts of data by distributing tasks over networked agents. Although it is relatively simple to implement diffusion strategies over a cluster, it appears to be challenging to deploy them in an ad-hoc network with limited energy budget for communication. In this paper, we introduce a diffusion LMS strategy that significantly reduces communication costs without compromising the performance. Then, we analyze the proposed algorithm in the mean and mean-square sense. Next, we conduct numerical experiments to confirm the theoretical findings. Finally, we perform large scale simulations to test the algorithm efficiency in a scenario where energy is limited.
A Multi-Horizon Quantile Recurrent Forecaster
Wen, Ruofeng, Torkkola, Kari, Narayanaswamy, Balakrishnan
We propose a framework for general probabilistic multi-step time series regression. Specifically, we exploit the expressiveness and temporal nature of Recurrent Neural Networks, the nonparametric nature of Quantile Regression and the efficiency of Direct Multi-Horizon Forecasting. A new training scheme for recurrent nets is designed to boost stability and performance. We show that the approach accommodates both temporal and static covariates, learning across multiple related series, shifting seasonality, future planned event spikes and cold-starts in real life large-scale forecasting. The performance of the framework is demonstrated in an application to predict the future demand of items sold on Amazon.com, and in a public probabilistic forecasting competition to predict electricity price and load.
Forest-based methods and ensemble model output statistics for rainfall ensemble forecasting
Taillardat, Maxime, Fougรจres, Anne-Laure, Naveau, Philippe, Mestre, Olivier
Rainfall ensemble forecasts have to be skillful for both low precipitation and extreme events. We present statistical post-processing methods based on Quantile Regression Forests (QRF) and Gradient Forests (GF) with a parametric extension for heavy-tailed distributions. Our goal is to improve ensemble quality for all types of precipitation events, heavy-tailed included, subject to a good overall performance. Our hybrid proposed methods are applied to daily 51-h forecasts of 6-h accumulated precipitation from 2012 to 2015 over France using the M{\'e}t{\'e}o-France ensemble prediction system called PEARP. They provide calibrated pre-dictive distributions and compete favourably with state-of-the-art methods like Analogs method or Ensemble Model Output Statistics. In particular, hybrid forest-based procedures appear to bring an added value to the forecast of heavy rainfall.
Large-scale power grid hierarchical segmentation based on power-flow affinities
Marot, Antoine, Tazi, Sami, Donnot, Benjamin, Panciatici, Patrick
The segmentation of large scale power grids into zones allows a better understanding of its structure, as the control room operators will naturally but manually do for any study. In this paper we provide a new automatic hierarchical method based on the community detection algorithm \textit{Infomap}. Our main contribution is to offer as input a new representation of the power grid, called the security analysis, that represents power flow affinities beyond the connectivity of the grid, a point that will become even more relevant for tomorrow's cyber-physical system. Indeed we already discover few relevant and important clusters that are not connected in the actual grid topology. To better describe and investigate the method, we apply it here on the well-studied IEEE-RTS-96 and IEEE-118. We further applied our method on the large-scale French Power Grid which showed promising results given its puzzling resemblance with the historical RTE regional segmentation.