Location: Crop Germplasm ResearchTitle: BAC-pool 454-sequencing: A rapid and efficient approach to sequence complex tetraploid cotton genomes Author
Submitted to: Plant and Animal Genome Conference Proceedings
Publication Type: Abstract Only
Publication Acceptance Date: 12/1/2008
Publication Date: 1/10/2009
Citation: Buyyarapu, R., Kantety, R., Xu, Z., Yu, J., Kohel, R.J., Percy, J.T., Macmil, S., Wiley, G., Roe, B., Sharma, G. 2009. BAC-pool 454-sequencing: A rapid and efficient approach to sequence complex tetraploid cotton genomes [abstract]. In: Proceedings of the Plant and Animal Genome Conference, January 10-14, 2009, San Diego, California. 2009 CDROM. Interpretive Summary:
Technical Abstract: New and emerging next generation sequencing technologies have been promising in reducing sequencing costs, but not significantly for complex polyploid plant genomes such as cotton. Large and highly repetitive genome of G. hirsutum (~2.5GB) is less amenable and cost-intensive with traditional BAC-by-BAC sequencing approach. Therefore, our objective was to sequence large genomic segments from cotton genome using a novel BAC-pool sequencing approach using 454-pyrosequencing technology and to test our ability to i) assemble the sequences, ii) determine the approximate number of genome equivalents required to thoroughly cover the genome, iii) identify any coding regions in the assembled contigs, and iv) characterize the repetitive regions. Two BAC contigs, #465 (chromosome 26) and #3301 (chromosome 12) from homeologous cotton chromosomes were selected for sequencing. Three to four BAC clones were pooled and sequenced using the GS20 FLX instrument. Contig-465 was sequenced up to '20.45X' coverage from 12 BAC clones with 4 clones per each pool. Similarly, ~767 KB contig 3301 was sequenced up to '23.11X' coverage from 7 BAC clones with 4 and 3 clones under each pool. Approximately 50MB of raw sequence data was assembled into 1.01 MB with several contigs among the five pools using Newbler and Phrap assembly programs. Sequencing data was further analyzed to verify the presence of BAC-end regions, molecular markers, ESTs, transposons, retrotransposons, coding regions, and matches with other related genomes. Pooled BAC sequencing using 454 technology helps in producing the de novo sequencing and assembling of large genomic segments rapidly and economically.