Location: Adaptive Cropping Systems Laboratory
Title: Machine learning-based prediction of cereal rye cover crop biomass across diverse agroecosystemsAuthor
![]() |
GHIMIRE, UTSAB - University Of Florida |
![]() |
MITRA, ALAKANANDA - University Of Nebraska |
![]() |
Fleisher, David |
![]() |
PARK, JOHN - Oak Ridge Institute For Science And Education (ORISE) |
![]() |
Barnaby, Jinyoung |
![]() |
KIM, YONGHYUN - George Washington University |
![]() |
Han, Eun Jin |
|
Submitted to: Agricultural & Environmental Letters
Publication Type: Peer Reviewed Journal Publication Acceptance Date: 12/12/2025 Publication Date: 1/9/2026 Citation: Ghimire, U., Mitra, A., Fleisher, D.H., Park, J., Barnaby, J.Y., Kim, Y., Han, E. 2026. Machine learning-based prediction of cereal rye cover crop biomass across diverse agroecosystems. Agricultural & Environmental Letters. https://doi.org/10.1002/ael2.70055. DOI: https://doi.org/10.1002/ael2.70055 Interpretive Summary: Cover crops like cereal rye provide many benefits to farms, such as improving soil health, reducing erosion, and supporting the next cash crop. To manage these benefits, farmers and advisors need reliable ways to estimate how much cover crop biomass (plant growth) will be produced. In this study, we developed machine learning models that use weather, soil, and basic management information to predict cereal rye growth. We trained these models using data collected from 24 states across the eastern U.S. The models were fairly accurate when using weather data from early spring, and adding later-season weather data sometimes improved the results but also made the models less reliable. We also tested a version of the model that provides a range of possible outcomes instead of a single number, which helps account for uncertainty. Our results show that with just public weather and soil data plus a little management information, it is possible to make useful and practical predictions about cover crop growth. Technical Abstract: Accurate operational predictions of cereal rye biomass are essential for quantifying the agroecosystem services provided by cover crops and for informing growers’ management decisions for subsequent cash crops. In this study, we developed machine learning (ML)-based biomass prediction models using two advanced gradient-boosted tree algorithms (CatBoost and XGBoost). A comprehensive dataset of cereal rye biomass and management information from 24 states in the eastern United States, combined with publicly available soil and weather data, was used to train the models. Models relying on early spring weather inputs achieved moderate predictive skill (R² ˜ 0.74). Incorporating later-season weather data modestly improved mid-season fits but led to overfitting in late-spring models. Extending CatBoost to quantile regression enabled estimation of 10–90% prediction intervals with moderate pinball loss. Overall, our findings demonstrate that public soil and weather data, supplemented with limited management inputs, can support interpretable, uncertainty-aware biomass predictions suitable for cover crop management. |
