AN INTEGRATED DATABASE AND BIOINFORMATICS RESOURCE FOR SMALL GRAINS
Location: Genomics and Gene Discovery
Title: RJPrimers: unique transposable element insertion junction discovery and PCR primer design for marker development
Submitted to: Nucleic Acids Research
Publication Type: Peer Reviewed Journal
Publication Acceptance Date: May 6, 2010
Publication Date: May 30, 2010
Citation: You, F., Wanjugi, H., Huo, N., Lazo, G.R., Luo, M., Anderson, O.D., Dvorak, J., Gu, Y.Q. 2010. RJPrimers: unique transposable element insertion junction discovery and PCR primer design for marker development. Nucleic Acids Research. 38:Suppl:W310-320.
Interpretive Summary: Developing a large number of molecular markers for mapping important traits still represents a great challenge in wheat due to its large and complex genome. In this work, a software tool, RJPrimers, was developed to identify unique repeat junction markers from random genomic sequences that can be easily generated by high-throughput next generation sequencing technologies such as Roche 454. The success rate for marker development with this software tool is over 90% based on the estimation of web lab experiments and in silico test using the sequenced rice genome. The repeat junction markers generated by RJPrimers could have wide applications including mapping, marker-assisted breeding, and genotyping. The repeat junction marker system can be easily transferred to other important crops with large and complex genomes. A web-based server and a command line based pipeline for RJPrimers have been implemented to meet different requirements and are available for public access at http://wheat.pw.usda.gov/demos/RJPrimers/.
Transposable elements (TE) exist in the genomes of nearly all eukaryotes. TE mobilization through “cut-and-paste” or “copy-and-paste” mechanisms causes their insertions into other repetitive sequences, gene loci, and other DNA. An insertion of a TE produces a junction consisting of the TE-end sequence on one side and the sequence of the insertion site on the other side. Most of these junctions are unique in a genome. Because of rapid amplification of TEs and their fast turnover, the junctions are also randomly distributed along chromosomes. TE-based junctions are therefore useful for genome-wide marker development in large and complex genomes such as wheat. Several TE-based marker systems have been developed and applied to genetic diversity assays, and to genetic and physical mapping. No web-based software tool is currently available for repeat junction discovery and fully automated high-throughput repeat junction-based primer design. A software tool “RJPrimers” is reported !
here. This tool first identifies potentially unique repeat junctions using BLASTN against fully annotated repeat databases and a repeat junction finding algorithm, and then designs repeat junction primers using Primer3 and BatchPrimer3. Using in silico and wet lab PCR experiments, the software was tested based on the rice genome, as well as Sanger shotgun sequences and Roche 454 reads of the diploid wheat Aegilops tauschii. Over 90% of repeat junction primers designed by RJPrimers were unique. At least one RJM marker per 10 Kb sequence of Ae. tauschii was expected with an estimate of 0.45 million such markers in a genome of 4.5Gb, providing an almost unlimited source of molecular markers for mapping large and complex genomes. A web-based server and a command line based pipeline for RJPrimers have been implemented to meet different requirements and are available at http://wheat.pw.usda.gov/demos/RJPrimers/.