AITopics | Corani, Giorgio

Collaborating Authors

Corani, Giorgio

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Forecasting intermittent time series with Gaussian Processes and Tweedie likelihood

Damato, Stefano, Azzimonti, Dario, Corani, Giorgio

arXiv.org Machine LearningFeb-27-2025

We introduce the use of Gaussian Processes (GPs) for the probabilistic forecasting of intermittent time series. The model is trained in a Bayesian framework that accounts for the uncertainty about the latent function and marginalizes it out when making predictions. We couple the latent GP variable with two types of forecast distributions: the negative binomial (NegBinGP) and the Tweedie distribution (TweedieGP). While the negative binomial has already been used in forecasting intermittent time series, this is the first time in which a fully parameterized Tweedie density is used for intermittent time series. We properly evaluate the Tweedie density, which is both zero-inflated and heavy tailed, avoiding simplifying assumptions made in existing models. We test our models on thousands of intermittent count time series. Results show that our models provide consistently better probabilistic forecasts than the competitors. In particular, TweedieGP obtains the best estimates of the highest quantiles, thus showing that it is more flexible than NegBinGP.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Machine Learning

2502.19086

Country: North America > United States > Utah (0.14)

Genre: Research Report > New Finding (0.34)

Industry: Retail (0.67)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)

Add feedback

Efficient probabilistic reconciliation of forecasts for real-valued and count time series

Zambon, Lorenzo, Azzimonti, Dario, Corani, Giorgio

arXiv.org Machine LearningOct-12-2023

Hierarchical time series are common in several applied fields. The forecasts for these time series are required to be coherent, that is, to satisfy the constraints given by the hierarchy. The most popular technique to enforce coherence is called reconciliation, which adjusts the base forecasts computed for each time series. However, recent works on probabilistic reconciliation present several limitations. In this paper, we propose a new approach based on conditioning to reconcile any type of forecast distribution. We then introduce a new algorithm, called Bottom-Up Importance Sampling, to efficiently sample from the reconciled distribution. It can be used for any base forecast distribution: discrete, continuous, or in the form of samples, providing a major speedup compared to the current methods. Experiments on several temporal hierarchies show a significant improvement over base probabilistic forecasts.

artificial intelligence, forecast, machine learning, (17 more...)

arXiv.org Machine Learning

2210.02286

Country: North America > United States (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Probabilistic Reconciliation of Count Time Series

Corani, Giorgio, Azzimonti, Dario, Rubattu, Nicolò

arXiv.org Machine LearningApr-26-2023

For example, the total sales of a product in a country can be divided into regions and then into sub-regions. Forecasts of hierarchical time series should be coherent; for instance, the sum of the forecasts of the different regions should be equal to the forecast for the entire country. Forecasts are incoherent if they do not satisfy such constraints. Reconciliation methods [13, 31] compute coherent forecasts by combining the base forecasts generated independently for each time series, possibly incorporating non-negativity constraints [32]. Reconciled forecasts are generally more accurate than the base forecasts; indeed, forecast reconciliation is related to forecast combination [9, 6]. A special case of reconciliation is constituted by temporal hierarchies [1], which reconcile base forecasts computed for the same variable at different frequencies (e.g., monthly, quarterly and yearly); they generally improve the forecasts [19] of smooth and intermittent time series. As for probabilistic reconciliation, [25] proposed a seminal framework which interprets reconciliation as a projection.

artificial intelligence, machine learning, modeling & simulation, (17 more...)

arXiv.org Machine Learning

doi: 10.1016/j.ijforecast.2023.04.003

2207.09322

Country: Europe (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.93)
Information Technology > Data Science (0.93)
Information Technology > Modeling & Simulation (0.68)

Add feedback

Automatic Forecasting using Gaussian Processes

Corani, Giorgio, Benavoli, Alessio, Augusto, Joao, Zaffalon, Marco

arXiv.org Machine LearningSep-17-2020

Automatic forecasting is the task of receiving a time series and returning a forecast for the next time steps without any human intervention. We propose an approach for automatic forecasting based on Gaussian Processes (GPs). So far, the main limits of GPs on this task have been the lack of a criterion for the selection of the kernel and the long times required for training different competing kernels. We design a fixed additive kernel, which contains the components needed to model most time series. During training the unnecessary components are made irrelevant by automatic relevance determination. We assign priors to each hyperparameter. We design the priors by analyzing a separate set of time series through a hierarchical GP. The resulting model performs very well on different types of time series, being competitive or outperforming the state-of-the-art approaches.Thanks to the priors, we reliably estimate the parameters with a single restart; this speedup makes the model efficient to train and suitable for processing a large number of time series.

artificial intelligence, machine learning, time series, (18 more...)

arXiv.org Machine Learning

2009.08102

Country:

Oceania > Australia (0.14)
Europe > Ireland (0.14)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Modeling & Simulation (0.87)

Add feedback

Structure Learning from Related Data Sets with a Hierarchical Bayesian Score

Azzimonti, Laura, Corani, Giorgio, Scutari, Marco

arXiv.org Machine LearningAug-4-2020

Score functions for learning the structure of Bayesian networks in the literature assume that data are a homogeneous set of observations; whereas it is often the case that they comprise different related, but not homogeneous, data sets collected in different ways. In this paper we propose a new Bayesian Dirichlet score, which we call Bayesian Hierarchical Dirichlet (BHD). The proposed score is based on a hierarchical model that pools information across data sets to learn a single encompassing network structure, while taking into account the differences in their probabilistic structures. We derive a closed-form expression for BHD using a variational approximation of the marginal likelihood and we study its performance using simulated data. We find that, when data comprise multiple related data sets, BHD outperforms the Bayesian Dirichlet equivalent uniform (BDeu) score in terms of reconstruction accuracy as measured by the Structural Hamming distance, and that it is as accurate as BDeu when data are homogeneous. Moreover, the estimated networks are sparser and therefore more interpretable than those obtained with BDeu, thanks to a lower number of false positive arcs.

artificial intelligence, machine learning, scenario, (15 more...)

arXiv.org Machine Learning

2008.01683

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Entropy-based Pruning for Learning Bayesian Networks using BIC

de Campos, Cassio P., Scanagatta, Mauro, Corani, Giorgio, Zaffalon, Marco

arXiv.org Machine LearningJul-19-2017

For decomposable score-based structure learning of Bayesian networks, existing approaches first compute a collection of candidate parent sets for each variable and then optimize over this collection by choosing one parent set for each variable without creating directed cycles while maximizing the total score. We target the task of constructing the collection of candidate parent sets when the score of choice is the Bayesian Information Criterion (BIC). We provide new non-trivial results that can be used to prune the search space of candidate parent sets of each node. We analyze how these new results relate to previous ideas in the literature both theoretically and empirically. We show in experiments with UCI data sets that gains can be significant. Since the new pruning rules are easy to implement and have low computational costs, they can be promptly integrated into all state-of-the-art methods for structure learning of Bayesian networks.

bayesian inference, candidate parent, health & medicine, (19 more...)

arXiv.org Machine Learning

1707.06194

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Time for a change: a tutorial for comparing multiple classifiers through Bayesian analysis

Benavoli, Alessio, Corani, Giorgio, Demsar, Janez, Zaffalon, Marco

arXiv.org Machine LearningJul-15-2017

The machine learning community adopted the use of null hypothesis significance testing (NHST) in order to ensure the statistical validity of results. Many scientific fields however realized the shortcomings of frequentist reasoning and in the most radical cases even banned its use in publications. We should do the same: just as we have embraced the Bayesian paradigm in the development of new machine learning methods, so we should also use it in the analysis of our own results. We argue for abandonment of NHST by exposing its fallacies and, more importantly, offer better - more sound and useful - alternatives for it.

bayesian inference, classifier, health & medicine, (19 more...)

arXiv.org Machine Learning

1606.04316

Country:

Europe > Slovenia (0.14)
Asia > Japan (0.14)

Genre: Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area (0.70)

Add feedback

Learning Treewidth-Bounded Bayesian Networks with Thousands of Variables

Scanagatta, Mauro, Corani, Giorgio, Campos, Cassio P. de, Zaffalon, Marco

Neural Information Processing SystemsDec-31-2016

We present a method for learning treewidth-bounded Bayesian networks from data sets containing thousands of variables. Bounding the treewidth of a Bayesian network greatly reduces the complexity of inferences. Yet, being a global property of the graph, it considerably increases the difficulty of the learning process. Our novel algorithm accomplishes this task, scaling both to large domains and to large treewidths. Our novel approach consistently outperforms the state of the art on experiments with up to thousands of variables.

artificial intelligence, bayesian inference, treewidth, (16 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > Northern Ireland (0.14)
Europe > Spain (0.14)

Genre:

Research Report > Promising Solution (0.48)
Overview > Innovation (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Statistical comparison of classifiers through Bayesian hierarchical modelling

Corani, Giorgio, Benavoli, Alessio, Demšar, Janez, Mangili, Francesca, Zaffalon, Marco

arXiv.org Machine LearningNov-22-2016

Usually one compares the accuracy of two competing classifiers via null hypothesis significance tests (nhst). Yet the nhst tests suffer from important shortcomings, which can be overcome by switching to Bayesian hypothesis testing. We propose a Bayesian hierarchical model which jointly analyzes the cross-validation results obtained by two classifiers on multiple data sets. It returns the posterior probability of the accuracies of the two classifiers being practically equivalent or significantly different. A further strength of the hierarchical model is that, by jointly analyzing the results obtained on all data sets, it reduces the estimation error compared to the usual approach of averaging the cross-validation results obtained on a given data set.

artificial intelligence, bayesian inference, classifier, (19 more...)

arXiv.org Machine Learning

1609.08905

Country: Europe > Slovenia (0.14)

Genre: Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

Learning Bayesian Networks with Thousands of Variables

Scanagatta, Mauro, Campos, Cassio P. de, Corani, Giorgio, Zaffalon, Marco

Neural Information Processing SystemsDec-31-2015

We present a method for learning Bayesian networks from data sets containing thousands of variables without the need for structure constraints. Our approach is made of two parts. The first is a novel algorithm that effectively explores the space of possible parent sets of a node. It guides the exploration towards the most promising parent sets on the basis of an approximated score function that is computed in constant time. The second part is an improvement of an existing ordering-based algorithm for structure optimization. The new algorithm provably achieves a higher score compared to its original formulation. Our novel approach consistently outperforms the state of the art on very large data sets.

artificial intelligence, bic, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > Northern Ireland (0.14)
North America > United States > California (0.14)

Genre:

Research Report > Promising Solution (0.34)
Overview > Innovation (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback