Submitted to: Genbank
Publication Type: Germplasm release
Publication Acceptance Date: 6/22/2007
Publication Date: 6/22/2007
Citation: Guo, B., Dang, P.M., Chen, X., Scully, B.T., Liang, X., Yu, J., Holbrook, Jr., C.C. 2007. Batch submission and release of 14,328 EST (expressed sequence tag) nucleotide sequences of developing peanut seeds of R5, R6, and R7 reproduction stages. GenBank, National Center for Biotechnology Information Accession No. ES702769-ES717096. Interpretive Summary:
Technical Abstract: Although peanut is an economically and nutritionally important crop in the U.S., it is virtually unexplored at the genomic level. Peanut has a relatively large genome size (2800 Mb/1C). A database of ESTs, the expressed portion of the peanut genome, will facilitate gene discovery, marker development, and gene functional characterization. We constructed six different cDNA libraries derived from developing seeds at three reproduction stages (R5, R6 and R7), challenged by Aspergillus parasiticus and drought stress, of two cultivated peanut genotypes, Tiftrunner (susceptible to Aspergillus infection with higher aflatoxin contamination and resistant to TSWV) and GT-C20 (susceptible to TSWV and reduced aflatoxin contamination). A total of 24,192 randomly selected cDNA clones were sequenced. After removing vector sequences and trimming, 21,098 high-quality EST sequences were generated. Sequence clustering and assembling resulted in 8,689 unique EST sequences with 1,741 contigs and 6,948 singletons. Functional classification was performed on the basis of MIPS functional catalogue criteria. The unique EST sequences were divided into twenty-two categories. Similarity search against the non-redundant protein database available from NCBI indicated that 84.78% of total ESTs showed significant similarity to known proteins, of which 165 genes were previously identified in peanut crop. This batch submission and release of 14,328 EST sequences to GenBank for public use could contribute considerable information to peanut functional genomics in novel gene discovery and marker development.