Skip to main content
ARS Home » Northeast Area » Beltsville, Maryland (BARC) » Beltsville Agricultural Research Center » Animal Genomics and Improvement Laboratory » Research » Publications at this Location » Publication #427356

Research Project: Improving Dairy Cow Feed Efficiency and Environmental Sustainability Using Genomics and Novel Technologies to Identify Physiological Contributions and Adaptations

Location: Animal Genomics and Improvement Laboratory

Title: Identifying data anomalies in milk component measurements from partial-day milking records

Author
item WU, XIAO-LIN - Council On Dairy Cattle Breeding
item CAPUTO, MALIA - Council On Dairy Cattle Breeding
item DONATONE, CHIP - Council On Dairy Cattle Breeding
item Miles, Asha
item Baldwin, Ransom
item SIEVERT, STEVEN - Collaborator
item MATTISON, JAY - Collaborator
item COLE, JOHN - Council On Dairy Cattle Breeding
item BURCHARD, JAVIER - Council On Dairy Cattle Breeding
item DURR, JOAO - Council On Dairy Cattle Breeding

Submitted to: Journal of Dairy Science Communications
Publication Type: Peer Reviewed Journal
Publication Acceptance Date: 11/14/2025
Publication Date: 12/13/2025
Citation: Wu, X., Caputo, M.J., Donatone, C., Miles, A.M., Baldwin, R.L., Sievert, S., Mattison, J., Cole, J.B., Burchard, J., Durr, J. 2025. Identifying data anomalies in milk component measurements from partial-day milking records. Journal of Dairy Science Communications. https://doi.org/10.3168/jdsc.2025-0825.
DOI: https://doi.org/10.3168/jdsc.2025-0825

Interpretive Summary: The quality of milk is based on the primary constituents (water, fat protein, lactose, minerals and other minor components. The collection and evaluation of milk component records and their correlation with quality are crucial for accurate genetic evaluation and effective daily herd management to produce high-quality milk. However, potential errors in the measurement or assessments of the individual constituents can compromise the validity of downstream management and genetic selection decisions. To evaluate the quality and consistency of records from a single milking from herds across different farms, we have used a statistical method called intraclass evaluation to identify farms where there are concerns about the quality milk component records. Beyond the herd, we also need to know the quality of records for each cow in the herd. In this study we determined that a new statistical metric, individual-level intraclass correlations (I-ICC) can be used. Using I-ICC was superior to other methods for identifying anomalies. This research will enhance the ability of dairy producers to identify suspect milk samples and improve milk component records at the individual cow level allowing dairy producers to make accurate choices in breeding and management strategies on farm.

Technical Abstract: High-quality milk and milk component data are crucial for accurate genetic evaluations and effective daily herd management. However, potential errors can compromise the validity of downstream decisions. In a recent study, we proposed using intraclass correlation (ICC) as a herd-level metric to assess the consistency of milk components from single milkings, thereby effectively identifying farms with potential data quality concerns. Yet a key challenge remains: can we effectively identify potentially erroneous records at the cow-day level? In the present study, we introduce a novel metric – individual-level intraclass correlations (I-ICC) – to assess data consistency at the cow-day level, and we evaluate its performance in comparison with three commonly used methods. We also introduce a two-step approach to estimate percentile thresholds for flagging outliers. The results demonstrate the superior performance of this new metric over the conventional univariate and multivariate methods in identifying anomalies in correlated partial daily milk component data. The Impact of data shuffling was also examined. Together, these methods provide robust and practical tools for detecting suspect milk component records at the individual cow-day level.