Skip to main content
ARS Home » Northeast Area » Beltsville, Maryland (BARC) » Beltsville Agricultural Research Center » Animal Genomics and Improvement Laboratory » Research » Publications at this Location » Publication #323683

Title: Progress toward a low budget reference grade genome assembly

Author
item Rosen, Benjamin - Ben
item Bickhart, Derek
item KOREN, SERGEY - National Institutes Of Health (NIH)
item HASTIE, ALEX - Bionano Genomics, Inc
item Schroeder, Steven - Steve
item Smith, Timothy - Tim
item SULLIVAN, SHAWN - US Department Of Agriculture (USDA)
item LIACHKO, IVAN - University Of Washington
item BURTON, JOSHUA - University Of Washington
item DUNHAM, MAITREYA - University Of Washington
item SENDURE, JAY - University Of Washington
item SAYRE, BRIAN - Virginia State University
item HUSON, HEATHER - Cornell University
item Liu, Ge - George
item Connor, Erin
item SONSTEGARD, TAD - Former ARS Employee
item PHILLIPPY, ADAM - National Institutes Of Health (NIH)
item Van Tassell, Curtis - Curt

Submitted to: Plant and Animal Genome Conference Proceedings
Publication Type: Abstract Only
Publication Acceptance Date: 1/8/2016
Publication Date: 1/8/2016
Publication URL: http://handle.nal.usda.gov/10113/62392
Citation: Rosen, B.D., Bickhart, D.M., Koren, S., Hastie, A.R., Schroeder, S.G., Smith, T.P., Sullivan, S.T., Liachko, I., Burton, J.N., Dunham, M.J., Sendure, J., Sayre, B.L., Huson, H.J., Liu, G., Connor, E.E., Sonstegard, T.S., Phillippy, A., Van Tassell, C.P. 2016. Progress toward a low budget reference grade genome assembly. Plant and Animal Genome Conference Proceedings. San Diego, CA, Jan. 9–13, P0612.

Interpretive Summary:

Technical Abstract: Reference quality de novo genome assemblies were once solely the domain of large, well-funded genome projects. While next-generation short read technology removed some of the cost barriers, accurate chromosome-scale assembly remains a real challenge. Here we present efforts to de novo assemble the goat (Capra hircus) genome. Through the combination of single-molecule technologies from Pacific Biosciences (sequencing) and BioNano Genomics (optical mapping) coupled with high-throughput chromosome conformation capture sequencing (Hi-C), an inbred San Clemente goat genome has been sequenced and assembled to a high degree of completeness at a relatively modest cost. Starting with 38 million PacBio reads, we integrated the MinHash Alignment Process (MHAP) with the Celera Assembler (CA) to produce an assembly comprised of 3110 contigs with a contig N50 size of 4.7 Mb. This assembly was scaffolded with BioNano genome maps derived from a single IrysChip into 333 scaffolds with an N50 of 23.1 Mb including the complete scaffolding of chromosome 20. Next, cis-chromosome associations were determined by Hi-C, yielding complete reconstruction of all autosomes into single scaffolds with a final N50 of 91.7 Mb. Finally, we hope to demonstrate that our methods are not only cost effective, but improve our ability to annotate challenging genomic regions such as highly repetitive immune gene clusters.