Skip to main content
ARS Home » Northeast Area » Beltsville, Maryland (BARC) » Beltsville Agricultural Research Center » Animal Parasitic Diseases Laboratory » Research » Publications at this Location » Publication #427201

Research Project: Immune and Genomic Determinants of Host Responses Associated with Porcine Reproductive and Respiratory Syndrome Virus Infections

Location: Animal Parasitic Diseases Laboratory

Title: Verification and comparison of pig, mouse, and human genome similarities: use of manual assembly and analyses

Author
item Dawson, Harry
item Chen, Celine
item Ragonese, Jack
item Smith, Allen
item Lunney, Joan

Submitted to: Research Square
Publication Type: Pre-print Publication
Publication Acceptance Date: 6/11/2025
Publication Date: 6/11/2025
Citation: Dawson, H.D., Chen, C.T., Ragonese, J.S., Smith, A.D., Lunney, J.K. 2025. Verification and comparison of pig, mouse, and human genome similarities: use of manual assembly and analyses. Research Square. https://doi.org/10.21203/rs.3.rs-6856588/v1.
DOI: https://doi.org/10.21203/rs.3.rs-6856588/v1

Interpretive Summary:

Technical Abstract: Background Recently there have been numerous attempts to improve the genome of the pig. Despite these efforts, there is a substantial amount of work remaining to obtain a “finished version” of the genome; analysis of incomplete versions can lead to incorrect biological interpretations. To that end, we manually assembled and annotated a non-redundant, 16,146 RNA and 15,613 pig protein sequence library. We used it to assess the assembly and annotation status of the 3 latest builds of the genome and to the mouse and human genomes. Results Our analysis of 3,333 protein coding genes reveals that the percentage of error-free assembled and annotated genes in NCBI and Ensembl builds 11.1 and MARC build 1.0 are 69.4, 50.1, and 40.0%, respectively. An examination of these errors revealed nine predominant sources that are detailed in the Results. Using our protein library, we determined 1:1 orthology to 16,496 mouse and 15,770 human proteins. 73.8% of these proteins were conserved among the 3 species; however, when a gene was missing from one of the three genomes, pigs were 5.0X more likely to have the human gene than mice. REACTOME, KEGG, GO BP Direct and Ingenuity Pathway Analysis functional enrichment analyses of pig-human orthologous genes revealed 8, 3, 14 and 32 conserved pathways, and 0, 3, 0, and 29 for human-mouse pathways, respectively. Last, we conducted an analysis of functional domain preservation for 3,465 proteins and discovered when a functional domain is missing from a protein in 1 of the 3 species, pigs are 1.5X more likely to have the human domain than mice. Conclusions These data strongly indicate that, overall, swine are a scientifically important intermediate species (rodent-human) for conducting scientific research on human health.