Goto

Collaborating Authors

 Rule-Based Reasoning


Machine learning offers shortcut to optimal HVAC operation

#artificialintelligence

Control mechanisms for heating, ventilation and air conditioning in buildings follow set parameters to make conditions in a building more comfortable, but what they save on time can reduce efficiency and increase energy costs, according to Gregory Pavlak, assistant professor of architectural engineering. More sophisticated control models, known as model predictive controllers, can optimize multiple variables to save on energy, operating costs and carbon emissions but can require much more time to find solutions. Penn State researchers developed a method that leverages machine learning to create controls that balance building energy cost, comfort and efficiency while computing at a fast pace. They published their findings in Energy in February. "Detailed model predictive controllers may not be able to compute solutions fast enough for real-time operations in some buildings," Pavlak said.


NetRCA: An Effective Network Fault Cause Localization Algorithm

arXiv.org Machine Learning

Localizing the root cause of network faults is crucial to network operation and maintenance. However, due to the complicated network architectures and wireless environments, as well as limited labeled data, accurately localizing the true root cause is challenging. In this paper, we propose a novel algorithm named NetRCA to deal with this problem. Firstly, we extract effective derived features from the original raw data by considering temporal, directional, attribution, and interaction characteristics. Secondly, we adopt multivariate time series similarity and label propagation to generate new training data from both labeled and unlabeled data to overcome the lack of labeled samples. Thirdly, we design an ensemble model which combines XGBoost, rule set learning, attribution model, and graph algorithm, to fully utilize all data information and enhance performance. Finally, experiments and analysis are conducted on the real-world dataset from ICASSP 2022 AIOps Challenge to demonstrate the superiority and effectiveness of our approach.


Making use of supercomputers in financial machine learning

arXiv.org Machine Learning

This article is the result of a collaboration between Fujitsu and Advestis. This collaboration aims at refactoring and running an algorithm based on systematic exploration producing investment recommendations on a high-performance computer of the Fugaku type [11], to see whether a very high number of cores could allow for a deeper exploration of the data compared to a cloud machine, hopefully resulting in better predictions. We found that an increase in the number of explored rules results in a net increase in the predictive performance of the final ruleset. Also, in the particular case of this study, we found that using more than around 40 cores does not bring a significant computation time gain. However, the origin of this limitation is explained by a threshold-based search heuristic used to prune the search space. We have evidence that for similar data sets with less restrictive thresholds, the number of cores actually used could very well be much higher, allowing parallelization to have a much greater effect.


Rule-based Evolutionary Bayesian Learning

arXiv.org Machine Learning

In our previous work, we introduced the rule-based Bayesian Regression, a methodology that leverages two concepts: (i) Bayesian inference, for the general framework and uncertainty quantification and (ii) rule-based systems for the incorporation of expert knowledge and intuition. The resulting method creates a penalty equivalent to a common Bayesian prior, but it also includes information that typically would not be available within a standard Bayesian context. In this work, we extend the aforementioned methodology with grammatical evolution, a symbolic genetic programming technique that we utilise for automating the rules' derivation. Our motivation is that grammatical evolution can potentially detect patterns from the data with valuable information, equivalent to that of expert knowledge. We illustrate the use of the rule-based Evolutionary Bayesian learning technique by applying it to synthetic as well as real data, and examine the results in terms of point predictions and associated uncertainty.


Council Post: Comparing Legacy Rules-Based Cybersecurity Platforms And AI-Based Platforms

#artificialintelligence

For example, this could include highly intensive data-crunching operations and pattern extraction. Intense analyst work of sifting through the massive amounts of data can be reduced by applying a generative model that learns patterns from the data and reduces the false positives associated with rule-based and supervised machine learning approaches.


Netflix tests its TikTok-like comedy feed on TVs

Engadget

You didn't think Netflix would leave its TikTok-style comedy feed on phones, did you? Sure enough, the company is launching a test that brings the Fast Laughs feature to TVs. Opt in and you'll get a flurry of hopefully funny clips from Netflix shows, movies and (of course) comedy specials. Find something you enjoy and you can watch the whole affair or add it to your watch list. The addition is "slowly" deploying to subscribers in English-speaking countries including the US, Canada, UK, Ireland, Australia and New Zealand.


Fast Interpretable Greedy-Tree Sums (FIGS)

arXiv.org Machine Learning

Modern machine learning has achieved impressive prediction performance, but often sacrifices interpretability, a critical consideration in many problems. Here, we propose Fast Interpretable Greedy-Tree Sums (FIGS), an algorithm for fitting concise rule-based models. Specifically, FIGS generalizes the CART algorithm to simultaneously grow a flexible number of trees in a summation. The total number of splits across all the trees can be restricted by a pre-specified threshold, thereby keeping both the size and number of its trees under control. When both are small, the fitted tree-sum can be easily visualized and written out by hand, making it highly interpretable. A partially oracle theoretical result hints at the potential for FIGS to overcome a key weakness of single-tree models by disentangling additive components of generative additive models, thereby reducing redundancy from repeated splits on the same feature. Furthermore, given oracle access to optimal tree structures, we obtain L2 generalization bounds for such generative models in the case of C1 component functions, matching known minimax rates in some cases. Extensive experiments across a wide array of real-world datasets show that FIGS achieves state-of-the-art prediction performance (among all popular rule-based methods) when restricted to just a few splits (e.g. less than 20). We find empirically that FIGS is able to avoid repeated splits, and often provides more concise decision rules than fitted decision trees, without sacrificing predictive performance. All code and models are released in a full-fledged package on Github \url{https://github.com/csinva/imodels}.


LIMREF: Local Interpretable Model Agnostic Rule-based Explanations for Forecasting, with an Application to Electricity Smart Meter Data

arXiv.org Artificial Intelligence

Accurate electricity demand forecasts play a crucial role in sustainable power systems. To enable better decision-making especially for demand flexibility of the end-user, it is necessary to provide not only accurate but also understandable and actionable forecasts. To provide accurate forecasts Global Forecasting Models (GFM) trained across time series have shown superior results in many demand forecasting competitions and real-world applications recently, compared with univariate forecasting approaches. We aim to fill the gap between the accuracy and the interpretability in global forecasting approaches. In order to explain the global model forecasts, we propose Local Interpretable Model-agnostic Rule-based Explanations for Forecasting (LIMREF), a local explainer framework that produces k-optimal impact rules for a particular forecast, considering the global forecasting model as a black-box model, in a model-agnostic way. It provides different types of rules that explain the forecast of the global model and the counterfactual rules, which provide actionable insights for potential changes to obtain different outputs for given instances. We conduct experiments using a large-scale electricity demand dataset with exogenous features such as temperature and calendar effects. Here, we evaluate the quality of the explanations produced by the LIMREF framework in terms of both qualitative and quantitative aspects such as accuracy, fidelity, and comprehensibility and benchmark those against other local explainers.


Measurably Stronger Explanation Reliability via Model Canonization

arXiv.org Artificial Intelligence

While rule-based attribution methods have proven useful for providing local explanations for Deep Neural Networks, explaining modern and more varied network architectures yields new challenges in generating trustworthy explanations, since the established rule sets might not be sufficient or applicable to novel network structures. As an elegant solution to the above issue, network canonization has recently been introduced. This procedure leverages the implementation-dependency of rule-based attributions and restructures a model into a functionally identical equivalent of alternative design to which established attribution rules can be applied. However, the idea of canonization and its usefulness have so far only been explored qualitatively. In this work, we quantitatively verify the beneficial effects of network canonization to rule-based attributions on VGG-16 and ResNet18 models with BatchNorm layers and thus extend the current best practices for obtaining reliable neural network explanations.


The Definition of Machine Learning

#artificialintelligence

A clear example of a scenario in which the manual process would fail is in the detection of faces in images. Today, a phone can detect a face in an image. But face detection was an unsolved problem in 2001. The main problem is the way pixels (which build an image on a computer) are perceived by the computer is very different from how humans perceive faces. This difference in representation makes it virtually impossible for a human to create a good set of rules.