Learning Bayesian Networks with Heterogeneous Agronomic Data Sets via Mixed-Effect Models and Hierarchical Clustering
Abstract: Maize, a crucial crop globally cultivated across vast regions, especially in sub-Saharan Africa, Asia, and Latin America, occupies 197 million hectares as of 2021. Various statistical and machine learning models, including mixed-effect models, random coefficients models, random forests, and deep learning architectures, have been devised to predict maize yield. These models consider factors such as genotype, environment, genotype-environment interaction, and field management. However, the existing models often fall short of fully exploiting the complex network of causal relationships among these factors and the hierarchical structure inherent in agronomic data. This study introduces an innovative approach integrating random effects into Bayesian networks (BNs), leveraging their capacity to model causal and probabilistic relationships through directed acyclic graphs. Rooted in the linear mixed-effects models framework and tailored for hierarchical data, this novel approach demonstrates enhanced BN learning. Application to a real-world agronomic trial produces a model with improved interpretability, unveiling new causal connections. Notably, the proposed method significantly reduces the error rate in maize yield prediction from 28% to 17%. These results advocate for the preference of BNs in constructing practical decision support tools for hierarchical agronomic data, facilitating causal inference.
- World Agriculture: Towards 2030/2050. ESA Working Paper No. 12–03; FAO: Rome, Italy.
- Prediction of Maize Grain Yield before Maturity Using Improved Temporal Height Estimates of Unmanned Aerial Systems. The Plant Phenome Journal 2, 1–15. doi:10.2135/tppj2019.02.0004.
- Hierarchical Estimation of Parameters in Bayesian Networks. Computational Statistics & Data Analysis 137, 67–91. doi:10.1016/j.csda.2019.02.004.
- Fitting Linear Mixed-Effects Models using lme4. Journal of Statistical Software 67, 1–48. doi:10.18637/jss.v067.i01.
- The Impact of Agricultural Landscape Diversification on U.S. Crop Production. Agriculture, Ecosystems & Environment 285, 106615. doi:10.1016/j.agee.2019.106615.
- Weather During Key Growth Stages Explains Grain Quality and Yield of Maize. Agronomy 9, 16. doi:10.3390/agronomy9010016.
- Urban Risks due to Climate Change in the Andean Municipality of Pasto, Colombia: A Bayesian Network Approach. Risk Analysis 43, 2017–2032. doi:10.1111/risa.14086.
- Characterizing Canopy Height with UAS Structure-From-Motion Photogrammetry—Results Analysis of a Maize Field Trial with Respect to Multiple Factors. Remote Sensing Letters 9, 753–762. doi:10.1080/2150704X.2018.1475771.
- Deep Neural Networks with Transfer Learning in Millet Crop Images. Computers in Industry 108, 115–120. doi:10.1016/j.compind.2019.02.003.
- Bayesian Network Learning with Cutting Planes, in: Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence, pp. 153–160.
- Modeling and reasoning with Bayesian networks. Cambridge University Press.
- Comparing Predictive Accuracy. Journal of Business & Economic Statistics 20, 134–144.
- Introduction to Graphical Modelling. 2nd ed., Springer.
- The State of Food Security and Nutrition in the World. URL: https://policycommons.net/artifacts/1850109/the-state-of-food-security-and-nutrition-in-the-world-2021/2596732/. FAO: Food and Agriculture Organization of the United Nations.
- FAOSTAT, 2019. Food Balance Sheets. URL: http://www.fao.org/faostat/en/data/FBS.
- Bayesian Data Analysis. 3rd ed., CRC Press.
- Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge University Press.
- Learning Big Gaussian Bayesian Networks: Partition, Estimation and Fusion. The Journal of Machine Learning Research 21, 1–31.
- Measurement and Calibration of Plant-Height from Fixed-Wing UAV Images. Sensors 18, 4092. doi:10.3390/s18124092.
- Learning Bayesian Networks: a Unification for Discrete and Gaussian Domains, in: UAI, pp. 274–284.
- Update of the nlme Package to Allow a Fixed Standard Deviation of the Residual Error. The R Journal 9, 239–251. doi:10.32614/RJ-2017-010.
- Structural Equation Modeling of Cover Crop Effects on Soil Nitrogen and Dry Bean. Agronomy Journal 109, 2781–2788. doi:10.2134/agronj2016.12.0712.
- Prediction of Maize Grain Yield before Maturity Using Improved Temporal Height Estimates of Unmanned Aerial Systems. The Plant Phenome Journal 2, 190004. doi:10.2135/tppj2019.02.0004.
- Irrigation Water Fitness Assessment Based on Bayesian Network and FAO Guidelines. Irrigation and Drainage 71, 665–675. doi:10.1002/ird.2676.
- Statistical Modelling of Crop Yield in Central Europe Using Climate Data and Remote Sensing Vegetation Indices. Agricultural and Forest Meteorology 260–261, 300–320. doi:10.1016/j.agrformet.2018.06.009.
- Combining Randomized Field Experiments with Observational Satellite Data to Assess the Benefits of Crop Rotations on Yields. Environmental Research Letters 17, 044066. doi:10.1088/1748-9326/ac6083.
- Probabilistic Graphical Models: Principles and Techniques. MIT Press.
- Crop Yield Prediction in India Based on Mayfly Optimization Empowered Attention-Bi-Directional Long Short-Term Memory (LSTM). Multimedia Tools and Applications Online first, 1–28. doi:10.1007/s11042-023-16807-7.
- Maize Yield Estimation in West Africa from Crop Process-Induced Combinations of Multi-Domain Remote Sensing Indices. European Journal of Agronomy 108, 11–26. doi:10.1016/j.eja.2019.04.007.
- Toward Building a Transparent Statistical Model for Improving Crop Yield Prediction: Modeling Rainfed Corn in the U.S. Field Crops Research 234, 55–65. doi:10.1016/j.fcr.2019.02.005.
- A Hierarchical Interannual Wheat Yield and Grain Protein Prediction Model Using Spectral Vegetative Indices and Meteorological Data. Field Crops Research 248, 107711. doi:10.1016/j.fcr.2019.107711.
- Disease Risk Forecasting with Bayesian Learning Networks: Application to Grape Powdery Mildew (Erysiphe necator) in Vineyards. Agronomy 10, 622. doi:10.3390/agronomy10050622.
- Genetic Correlation among Various Quantitative Characters in Maize (Zea mays L.) Hybrids. Journal of Agriculture & Social Sciences 1, 262–265.
- Genome-Wide Analysis of Yield in Europe: Allelic Effects as Functions of Drought and Heat Scenarios. Plant Physiology 172, 749–764. doi:10.1104/pp.16.00621.
- Genomic Prediction of Maize Yield Across European Environmental Conditions. Nature Genetics 51, 952–956. doi:10.1038/s41588-019-0414-y.
- A Multi-Site Experiment in a Network of European Fields for Assessing the Maize Yield Response to Environmental Scenarios. doi:10.15454/IASSTN.
- Evaluating Machine Learning Algorithms for Predicting Maize Yield Under Conservation Agriculture in Eastern and Southern Africa. SN Applied Sciences 2, 952. doi:10.1007/s42452-020-2711-6.
- Ward’s Hierarchical Agglomerative Clustering Method: Which Algorithms Implement Ward’s Criterion? Journal of Classification 31, 274–295. doi:10.1007/s00357-014-9161-z.
- Genome-Wide Association Studies of Grain Yield and Quality Traits under Optimum and Low-Nitrogen Stress in Tropical Maize (Zea mays L.). Theoretical and Applied Genetics 135, 4351–4370. doi:10.1007/s00122-022-04224-7.
- High Temperatures around Flowering in Maize: Effects on Photosynthesis and Grain Yield in Three Genotypes. Crop Science 56, 2702–2712. doi:10.2135/cropsci2015.12.0755.
- Transfer Learning for Bayesian Discovery of Multiple Bayesian Networks. Knowledge and Information Systems 43, 1–28. doi:10.1007/s10115-014-0775-6.
- A Survey on Transfer Learning. IEEE Transactions on Knowledge and Data Engineering 22, 1345–1359. doi:10.1109/TKDE.2009.191.
- A New Class of Resolvable Incomplete Block Designs. Biometrika 63, 83–92. doi:10.1093/biomet/63.1.83.
- Transfer Learning for Multi-Crop Leaf Disease Image Classification using Convolutional Neural Network VGG. Artificial Intelligence in Agriculture 6, 23–33. doi:10.1016/j.aiia.2021.12.002.
- Causality: Models, Reasoning and Inference. 2nd ed., Cambridge University Press.
- Incident Analysis and Prediction Using Clustering and Bayesian Network, in: 2017 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computed, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI, pp. 1–8. doi:10.1109/UIC-ATC.2017.8397587.
- Pew Research Center, 2019. World’s Population is Projected to Nearly Stop Growing by the End of the Century. URL: https://www.pewresearch.org/short-reads/2019/06/17/.
- Mixed-Effects Models in S and S-PLUS. Springer.
- Temporal Estimates of Crop Growth in Sorghum and Maize Breeding Enabled by Unmanned Aerial Systems. The Plant Phenome Journal 1, 1–10. doi:10.2135/tppj2017.08.0006.
- On the application of multilevel modeling in environmental and ecological studies. Ecology 91, 355–361. doi:10.1890/09-1043.1.
- Multipartition Clustering of Mixed Data with Bayesian Networks. International Journal of Intelligent Systems 37, 2188–2218. doi:10.1002/int.22770.
- Untangling Genotype x Management Interactions in Multi-Environment On-Farm Experimentation. Field Crops Research 255, 107900. doi:10.1016/j.fcr.2020.107900.
- Bayesian Computing with INLA: A Review. Annual Review of Statistics and Its Application 4, 395–421. doi:10.1146/annurev-statistics-060116-054045.
- Artificial Intelligence: A Modern Approach. 3rd ed., Prentice Hall.
- Forecasting Maize Yield at Field Scale Based on High-Resolution Satellite Imagery. Biosystems Engineering 171, 179–192. doi:10.1016/j.biosystemseng.2018.04.020.
- Estimating the Dimension of a Model. The Annals of Statistics 6, 461–464.
- Learning Bayesian Networks with the bnlearn R Package. Journal of Statistical Software 35, 1–22.
- Who Learns Better Bayesian Network Structures: Accuracy and Speed of Structure Learning Algorithms. International Journal of Approximate Reasoning 115, 235–253.
- Using Mixed-Effects Models to Learn Bayesian Networks from Related Data Sets. Proceedings of Machine Learning Research 186, 73–84.
- Impact of Climate Extreme Events and Their Causality on Maize Yield in South Africa. Scientific Reports 13, 12462. doi:10.1038/s41598-023-38921-0.
- Bayesian Approaches to Clinical Trials and Health-Care Evaluation. Wiley.
- Causation, Prediction, and Search. MIT Press.
- Genomic Prediction and Association Mapping of Maize Grain Yield in Multi-Environment Trials Based on Reaction Norm Models. Frontiers in Genetics 14, 1221751.
- Thermal Stresses in Maize: Effects and Management Strategies. Plants 10, 293. doi:10.3390/plants10020293.
- Operational Adjustment Modeling Approach Based on Bayesian Network Transfer Learning for New Flotation Process Under Scarce Data. Journal of Process Control 128, 103000. doi:10.1016/j.jprocont.2023.103000.
- The Optimal Phenological Phase of Maize for Yield Prediction with High-Frequency UAV Remote Sensing. Remote Sensing 14, 1559. doi:10.3390/rs14071559.
- Estimation of Corn Yield Based on Hyperspectral Imagery and Convolutional Neural Network. Computers and Electronics in Agriculture 184, 106092. doi:10.1016/j.compag.2021.106092.
- In-Season Prediction of Corn Yield Using Plant Height under Major Production Systems. Agronomy Journal 103, 923–929. doi:10.2134/agronj2010.0450.
- Integrating Satellite-Derived Climatic and Vegetation Indices to Predict Smallholder Maize Yield Using Deep Learning. Agricultural and Forest Meteorology 311, 108666. doi:10.1016/j.agrformet.2021.108666.
- A Regional Maize Yield Hierarchical Linear Model Combining Landsat 8 Vegetative Indices and Meteorological Data: Case Study in Jilin Province. Remote Sensing 13, 356. doi:10.3390/rs13030356.
- Best Linear Unbiased Predictions of Environmental Effects on Grain Yield in Maize Variety Trials of Different Maturity Groups. Agronomy 12, 922. doi:10.3390/agronomy12040922.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.