Skip to main content
ARS Home » Northeast Area » Beltsville, Maryland (BARC) » Beltsville Agricultural Research Center » Animal Genomics and Improvement Laboratory » Research » Publications at this Location » Publication #350332

Research Project: Improving Dairy Animals by Increasing Accuracy of Genomic Prediction, Evaluating New Traits, and Redefining Selection Goals

Location: Animal Genomics and Improvement Laboratory

Title: Methods for discovering and validating relationships among genotyped animals

Author
item Wiggans, G - Collaborator
item Vanraden, Paul
item Bacheller, Lillian - Lil

Submitted to: Interbull Annual Meeting Proceedings
Publication Type: Proceedings
Publication Acceptance Date: 2/26/2018
Publication Date: 2/26/2018
Citation: Wiggans, G.R., Van Raden, P.M., Bacheller, L.R. 2018. Methods for discovering and validating relationships among genotyped animals. Interbull Bulletin. 53:27-30.

Interpretive Summary: Genomic selection based on information from thousands of DNA markers has led to the collection of genotypes for more than 2.2 million animals by the Council on Dairy Cattle Breeding in the United States. Genotypes must be assigned to the correct animal, and also can be used to find and correct errors in pedigrees. As of January 2018, sires were validated using genotype information for 97% of the 2.2 million animalsgenotyped, and dams for 39%. A genotype is compared with all other genotypes to detect unknown parent-progeny relationships. If a parent is not confirmed, the grandsire is checked. If a grandsire is unknown or designated as unlikely, possible “true” grandsires are proposed. The comparison of each genotype with all others is a major and increasing consumer of computer resources. Because processing time has continued to increase, ways to reduce the time have been investigated. In 2012, a set of 1,000 SNPs that are present on nearly all chips was selected for preliminary screening. To further speed up processing, a set of 100 SNPs recently was selected based on minor allele frequency, call rate, and Mendelian consistency. Tests with the 100-SNP set showed that excluding cases with more than three opposite homozygotes could eliminate 99.7% of genotypes without eliminating any confirmed parent-progeny relationships. The assignment of correct ancestry is important because it results in faster rates of genetic gain and lower bias.

Technical Abstract: Genomic selection based on single-nucleotide polymorphisms (SNPs) has led to the collection of genotypes for over 2.2 million animals by the Council on Dairy Cattle Breeding in the United States. To assure that a genotype is assigned to the correct animal and that the animal’s pedigree is correct, the pedigree parents are checked. As of January 2018, the sire was validated for 97% and the dam for 39% of the 2.2 million genotyped animals that passed edits. The genotype is compared with all other genotypes to detect unknown parent-progeny relationships or identical genotypes. If a parent is not confirmed, the grandsire is checked. If a grandsire is unknown or designated as unlikely, possible grandsires are proposed. If SNP conflicts for a parent-progeny pair are concentrated on a single chromosome, a chromosomal deletion or other abnormality is considered; 102 such cases have been detected. All comparisons consider the SNPs in common between the genotypes from the current 30 chip types. Comparison of each genotype with all others is a major and increasing consumer of computer resources. Because processing time has continued to increase, ways to reduce the time have been investigated. In 2012, a set of 1,000 SNPs that are present on nearly all chips was selected for preliminary screening. To further speed up processing, a set of 100 SNPs recently was selected based on minor allele frequency, call rate, and Mendelian consistency. Tests with the 100-SNP set showed that excluding cases with more than three opposite homozygotes could eliminate 99.7% of genotypes without eliminating any confirmed parent-progeny relationships. A continuing effort is required to maintain extensive checking and pedigree correction within the time available for processing incoming genotypes and applying updates caused by pedigree changes. In addition to grandsire checking when genotypes are loaded, maternal grandsire and maternal great-grandsire are checked and discovered using haplotypes from the imputation process as part of the genomic evaluation. For dams with unknown sires, the discovered maternal grandsire is assigned as her sire. The genotypes provide a rich source of information for validation and discovery of genetic relationships.