Author
Rosen, Benjamin - Ben | |
Bickhart, Derek | |
KOREN, SERGEY - National Institutes Of Health (NIH) | |
HASTIE, ALEX - Bionano Genomics, Inc | |
Schroeder, Steven - Steve | |
Smith, Timothy - Tim | |
SULLIVAN, SHAWN - US Department Of Agriculture (USDA) | |
LIACHKO, IVAN - University Of Washington | |
BURTON, JOSHUA - University Of Washington | |
DUNHAM, MAITREYA - University Of Washington | |
SENDURE, JAY - University Of Washington | |
SAYRE, BRIAN - Virginia State University | |
HUSON, HEATHER - Cornell University | |
Liu, Ge - George | |
Connor, Erin | |
SONSTEGARD, TAD - Former ARS Employee | |
PHILLIPPY, ADAM - National Institutes Of Health (NIH) | |
Van Tassell, Curtis - Curt |
Submitted to: Plant and Animal Genome Conference Proceedings
Publication Type: Abstract Only Publication Acceptance Date: 1/8/2016 Publication Date: 1/8/2016 Publication URL: http://handle.nal.usda.gov/10113/62392 Citation: Rosen, B.D., Bickhart, D.M., Koren, S., Hastie, A.R., Schroeder, S.G., Smith, T.P., Sullivan, S.T., Liachko, I., Burton, J.N., Dunham, M.J., Sendure, J., Sayre, B.L., Huson, H.J., Liu, G., Connor, E.E., Sonstegard, T.S., Phillippy, A., Van Tassell, C.P. 2016. Progress toward a low budget reference grade genome assembly. Plant and Animal Genome Conference Proceedings. San Diego, CA, Jan. 9–13, P0612. Interpretive Summary: Technical Abstract: Reference quality de novo genome assemblies were once solely the domain of large, well-funded genome projects. While next-generation short read technology removed some of the cost barriers, accurate chromosome-scale assembly remains a real challenge. Here we present efforts to de novo assemble the goat (Capra hircus) genome. Through the combination of single-molecule technologies from Pacific Biosciences (sequencing) and BioNano Genomics (optical mapping) coupled with high-throughput chromosome conformation capture sequencing (Hi-C), an inbred San Clemente goat genome has been sequenced and assembled to a high degree of completeness at a relatively modest cost. Starting with 38 million PacBio reads, we integrated the MinHash Alignment Process (MHAP) with the Celera Assembler (CA) to produce an assembly comprised of 3110 contigs with a contig N50 size of 4.7 Mb. This assembly was scaffolded with BioNano genome maps derived from a single IrysChip into 333 scaffolds with an N50 of 23.1 Mb including the complete scaffolding of chromosome 20. Next, cis-chromosome associations were determined by Hi-C, yielding complete reconstruction of all autosomes into single scaffolds with a final N50 of 91.7 Mb. Finally, we hope to demonstrate that our methods are not only cost effective, but improve our ability to annotate challenging genomic regions such as highly repetitive immune gene clusters. |