Page Banner

United States Department of Agriculture

Agricultural Research Service

Research Project: COTTON GENOMICS AND GENETIC ANALYSIS

Location: Crop Germplasm Research

Title: BAC-pool 454-sequencing: A rapid and efficient approach to sequence complex tetraploid cotton genomes

Authors
item Buyyarapu, R. -
item Kantety, R. -
item Xu, Z. -
item Yu, John
item Kohel, Russell
item Percy, Joel
item Macmil, S. -
item Wiley, G. -
item Roe, B. -
item Sharma, G. -

Submitted to: Plant and Animal Genome Conference Proceedings
Publication Type: Abstract Only
Publication Acceptance Date: December 1, 2008
Publication Date: January 10, 2009
Citation: Buyyarapu, R., Kantety, R., Xu, Z., Yu, J., Kohel, R.J., Percy, J.T., Macmil, S., Wiley, G., Roe, B., Sharma, G. 2009. BAC-pool 454-sequencing: A rapid and efficient approach to sequence complex tetraploid cotton genomes [abstract]. In: Proceedings of the Plant and Animal Genome Conference, January 10-14, 2009, San Diego, California. 2009 CDROM.

Technical Abstract: New and emerging next generation sequencing technologies have been promising in reducing sequencing costs, but not significantly for complex polyploid plant genomes such as cotton. Large and highly repetitive genome of G. hirsutum (~2.5GB) is less amenable and cost-intensive with traditional BAC-by-BAC sequencing approach. Therefore, our objective was to sequence large genomic segments from cotton genome using a novel BAC-pool sequencing approach using 454-pyrosequencing technology and to test our ability to i) assemble the sequences, ii) determine the approximate number of genome equivalents required to thoroughly cover the genome, iii) identify any coding regions in the assembled contigs, and iv) characterize the repetitive regions. Two BAC contigs, #465 (chromosome 26) and #3301 (chromosome 12) from homeologous cotton chromosomes were selected for sequencing. Three to four BAC clones were pooled and sequenced using the GS20 FLX instrument. Contig-465 was sequenced up to '20.45X' coverage from 12 BAC clones with 4 clones per each pool. Similarly, ~767 KB contig 3301 was sequenced up to '23.11X' coverage from 7 BAC clones with 4 and 3 clones under each pool. Approximately 50MB of raw sequence data was assembled into 1.01 MB with several contigs among the five pools using Newbler and Phrap assembly programs. Sequencing data was further analyzed to verify the presence of BAC-end regions, molecular markers, ESTs, transposons, retrotransposons, coding regions, and matches with other related genomes. Pooled BAC sequencing using 454 technology helps in producing the de novo sequencing and assembling of large genomic segments rapidly and economically.

Last Modified: 9/1/2014
Footer Content Back to Top of Page