Genomics and Gene Discovery Site Logo
ARS Home About Us Helptop nav spacerContact Us En Espanoltop nav spacer
Printable VersionPrintable Version     E-mail this pageE-mail this page
Agricultural Research Service United States Department of Agriculture
Search
  Advanced Search
 
Programs and Projects
Subjects of Investigation
 

Research Project: AN INTEGRATED DATABASE AND BIOINFORMATICS RESOURCE FOR SMALL GRAINS

Location: Genomics and Gene Discovery

Title: Rapid genome mapping in nano channel array for highly complete and accurate de novo sequence assembly of the complex Aegilops tauschii genome

Authors
item Hastie, Alex -
item Dong, Lingli -
item Luo, Mingcheng -
item Huo, Naxin -
item Gu, Yong
item Xiao, Ming -

Submitted to: PLoS One
Publication Type: Peer Reviewed Journal
Publication Acceptance Date: January 3, 2013
Publication Date: February 6, 2013
Citation: Hastie, A., Dong, L., Luo, M., Huo, N., Gu, Y.Q., Xiao, M. 2013. Rapid genome mapping in nano channel array for highly complete and accurate de novo sequence assembly of the complex Aegilops tauschii genome. PLoS One. 8:e55864.

Interpretive Summary: Because of the extremely large size and polyploid nature of the wheat genome, sequencing and accurate assembly to generate a gold standard reference sequence for the wheat genome still represents a great challenge. This has hindered the rapid development of genomics research in wheat for crop improvement. In this work, we developed a novel approach to sequence the wheat genomic regions by using a combination of high-throughput Roche 454 sequencing and nanomapping of large insert bacterial artificial chromosome (BAC) clones. The Roche 454 provides sufficient coverage for sequence assembly, while the nanomapping information guides the assembly by ordering sequence contigs to form a linear sequence. We demonstrated that the accuracy of assembled sequences increased dramatically using this novel approach. The technology will be very useful for the international wheat genome sequencing community with the aim to generate a completely assembled wheat genome.

Technical Abstract: Next-generation sequencing (NGS) technologies have enabled high-throughput and low-cost generation of sequence data; however, de novo genome assembly remains a great challenge, particularly for large genomes. NGS short reads are often insufficient to create large contigs that span repeat sequences and to facilitate unambiguous assembly. Plant genomes are notorious for containing high levels of repetitive elements, which combined with huge genome sizes, makes accurate assembly of these large and complex genomes intractable thus far. Using two-color genome mapping of tiling bacterial artificial chromosomes (BAC) clones on nanochannel arrays, we completed high-confidence assembly of a 2.1-Mb, highly repetitive region in the large and complex genome of Aegilops tauschii, the D-genome donor of hexaploid wheat (Triticum aestivum). Genome mapping is based on direct visualization of sequence motifs on single DNA molecules hundreds of kilobases in size, and thus, it avoids most of the pitfalls of sequence-based assembly. With the genome map as a scaffold, we anchored unplaced sequence contigs, validated the initial draft assembly, and resolved instances of misassembly, some involving contigs <2 kb long, to dramatically improved the assembly from 72% to 98% complete.

   

 
Project Team
Anderson, Olin
Gu, Yong
Lazo, Gerard
Matthews, David
 
Publications
   Publications
 
Related National Programs
  Plant Genetic Resources, Genomics and Genetic Improvement (301)
 
Related Projects
   The North American Collaborative Oat Research Enterprise (CORE)
   Improving Barley and Wheat Germplasm for Changing Environments
 
 
Last Modified: 05/23/2013
ARS Home | USDA.gov | Site Map | Policies and Links 
FOIA | Accessibility Statement | Privacy Policy | Nondiscrimination Statement | Information Quality | USA.gov | White House