Skip to main content
ARS Home » Northeast Area » Ithaca, New York » Robert W. Holley Center for Agriculture & Health » Plant, Soil and Nutrition Research » Research » Publications at this Location » Publication #398855

Research Project: Database Tools for Managing and Analyzing Big Data Sets to Enhance Small Grains Breeding

Location: Plant, Soil and Nutrition Research

Title: Multi-omics prediction of oat agronomic and seed nutritional traits across environments and in distantly related populations

Author
item HU, HAIXIAO - Cornell University
item CAMPBELL, MALACHY - Cornell University
item YEATS, TREVOR - Cornell University
item ZHENG, XUYING - Cornell University
item RUNCIE, DANIEL - University Of California, Davis
item COVARRUBIAS-PAZARAN, GIOVANNY - International Maize & Wheat Improvement Center (CIMMYT)
item BROECKLING, COREY - Colorado State University
item YAO, LINXING - Colorado State University
item CAFFE-TREML, MELANIE - South Dakota State University
item Jannink, Jean-Luc
item GUTIERREZLUCIA - University Of Wisconsin
item SMITH, KEVIN - University Of Minnesota
item TANAKA, JAMES - Cornell University
item HOEKENGA, OWEN - Cayuga Genetics Consulting Group, Llc
item SORRELLS, MARK - Cornell University
item GORE, MICHAEL - Cornell University

Submitted to: Theoretical and Applied Genetics
Publication Type: Peer Reviewed Journal
Publication Acceptance Date: 9/5/2021
Publication Date: 10/13/2021
Citation: Hu, H., Campbell, M.T., Yeats, T.H., Zheng, X., Runcie, D.E., Covarrubias-Pazaran, G., Broeckling, C., Yao, L., Caffe-Treml, M., Jannink, J., Gutierrezlucia, Smith, K., Tanaka, J., Hoekenga, O., Sorrells, M.E., Gore, M.A. 2021. Multi-omics prediction of oat agronomic and seed nutritional traits across environments and in distantly related populations. Theoretical and Applied Genetics. 134:4043-4054. https://doi.org/10.1007/s00122-021-03946-4.
DOI: https://doi.org/10.1007/s00122-021-03946-4

Interpretive Summary: Genomic prediction involves using DNA markers across the whole genome to make predictions about the future performance of an individual. It is now also possible to measure the full metabolite profile, the metabolize and the full RNA transcript profile, the transcriptome. These measurements make "multi-omic" prediction possible. Multi-omics prediction has been shown to be superior to genomic prediction with genome-wide DNA-based genetic markers (G) for predicting phenotypes. We designed a systematic experiment to collect -omics data and evaluate 17 traits in two oat breeding populations planted in single and multiple environments. In the single-environment trial, transcriptomic, metabolomic, and genomic models, as well as their combinations, showed greater prediction accuracy than genomic alone models, and metabolites generally performed better than transcripts when combined with SNPs. In the multi-environment trial, multi-trait models with -omics data outperformed both multi-trait genomic and single-environment multi-omics models. The highest prediction accuracy was achieved when modeling genetic covariance as an unstructured covariance model. We demonstrated that omics data can be used to prioritize loci from one population to improve genomic prediction in a distantly related population.

Technical Abstract: Multi-omics prediction has been shown to be superior to genomic prediction with genome-wide DNA-based genetic markers (G) for predicting phenotypes. However, most of the existing studies were based on historical datasets from one environment; therefore, they were unable to evaluate the efficiency of multi-omics prediction in multi-environment trials and distantly related populations. To fill those gaps, we designed a systematic experiment to collect omics data and evaluate 17 traits in two oat breeding populations planted in single and multiple environments. In the single-environment trial, transcriptomic BLUP (T), metabolomic BLUP (M), G'+'T, G'+'M, and G'+'T'+'M models showed greater prediction accuracy than GBLUP for 5, 10, 11, 17, and 17 traits, respectively, and metabolites generally performed better than transcripts when combined with SNPs. In the multi-environment trial, multi-trait models with omics data outperformed both counterpart multi-trait GBLUP models and single-environment omics models, and the highest prediction accuracy was achieved when modeling genetic covariance as an unstructured covariance model. We also demonstrated that omics data can be used to prioritize loci from one population with omics data to improve genomic prediction in a distantly related population using a two-kernel linear model that accommodated both likely casual loci with large-effect and loci that explain little or no phenotypic variance. We propose that the two-kernel linear model is superior to most genomic prediction models that assume each variant is equally likely to affect the trait and can be used to improve prediction accuracy for any trait with prior knowledge of genetic architecture.