Goto

Collaborating Authors

 cgbn


Learning Bayesian Networks with Heterogeneous Agronomic Data Sets via Mixed-Effect Models and Hierarchical Clustering

arXiv.org Machine Learning

Maize, a crucial crop globally cultivated across vast regions, especially in sub-Saharan Africa, Asia, and Latin America, occupies 197 million hectares as of 2021. Various statistical and machine learning models, including mixed-effect models, random coefficients models, random forests, and deep learning architectures, have been devised to predict maize yield. These models consider factors such as genotype, environment, genotype-environment interaction, and field management. However, the existing models often fall short of fully exploiting the complex network of causal relationships among these factors and the hierarchical structure inherent in agronomic data. This study introduces an innovative approach integrating random effects into Bayesian networks (BNs), leveraging their capacity to model causal and probabilistic relationships through directed acyclic graphs. Rooted in the linear mixed-effects models framework and tailored for hierarchical data, this novel approach demonstrates enhanced BN learning. Application to a real-world agronomic trial produces a model with improved interpretability, unveiling new causal connections. Notably, the proposed method significantly reduces the error rate in maize yield prediction from 28% to 17%. These results advocate for the preference of BNs in constructing practical decision support tools for hierarchical agronomic data, facilitating causal inference.


Using Mixed-Effects Models to Learn Bayesian Networks from Related Data Sets

arXiv.org Artificial Intelligence

We commonly assume that data are a homogeneous set of observations when learning the structure of Bayesian networks. However, they often comprise different data sets that are related but not homogeneous because they have been collected in different ways or from different populations. In our previous work (Azzimonti, Corani and Scutari, 2021), we proposed a closed-form Bayesian Hierarchical Dirichlet score for discrete data that pools information across related data sets to learn a single encompassing network structure, while taking into account the differences in their probabilistic structures. In this paper, we provide an analogous solution for learning a Bayesian network from continuous data using mixed-effects models to pool information across the related data sets. We study its structural, parametric, predictive and classification accuracy and we show that it outperforms both conditional Gaussian Bayesian networks (that do not perform any pooling) and classical Gaussian Bayesian networks (that disregard the heterogeneous nature of the data). The improvement is marked for low sample sizes and for unbalanced data sets.


Comments on: "Hybrid Semiparametric Bayesian Networks"

arXiv.org Artificial Intelligence

This is an interesting paper that distils structure learning in Bayesian networks (BNs) and kernel methods in a quest to produce more flexible distributional assumptions. Conditional (linear) Gaussian Bayesian networks (CGBNs) have been well explored in the literature for some time, to the point that they now appear in many recent textbooks [1-3]. The authors address one of the key limitations of CGBNS, that they can only capture linear dependencies between the continuous variables they contain, and remove it by replacing (mixtures of) linear regression models with more general kernel densities. Dependencies between discrete variables were already flexible, because the conditional probability tables that parametrise them essentially act as a saturated model[4]. It is not obvious that more flexibility will produce better models for whatever task we have in mind: it can also lead to overfitting, instability and hyperparameter tuning problems.


Identifiability and Consistency of Bayesian Network Structure Learning from Incomplete Data

arXiv.org Machine Learning

Bayesian network (BN) structure learning from complete data has been extensively studied in the literature. However, fewer theoretical results are available for incomplete data, and most are based on the use of the Expectation-Maximisation (EM) algorithm. Balov (2013) proposed an alternative approach called Node-Average Likelihood (NAL) that is competitive with EM but computationally more efficient; and proved its consistency and model identifiability for discrete BNs. In this paper, we give general sufficient conditions for the consistency of NAL; and we prove consistency and identifiability for conditional Gaussian BNs, which include discrete and Gaussian BNs as special cases. Hence NAL has a wider applicability than originally stated in Balov (2013).