Location: Corn Insects and Crop Genetics Research2012 Annual Report
1a. Objectives (from AD-416):
Objective 1: Implement web-accessible computational and visualization tools, including semantic web technologies, to enable comparison and transfer of agronomically important genetic information among soybean and other legume and related dicot species. Objective 2: Continue to curate and enhance SoyBase and the Soybean Breeder’s Toolbox (SBT), more fully integrating the genetic, phenotypic, physical map, and whole-genome sequence data from soybean and other legumes. Objective 3: Coordinate the quality assembly and annotation of the soybean whole-genome sequence.
1b. Approach (from AD-416):
Soybean ontologies will be prepared to describe selected data types from the Soybean Breeders Toolbox (SBT). Data exchange descriptions (“RDF graphs”) will be developed to allow integration of the data into the Virtual Plant Information Network (VPIN). To let researchers transparently find, retrieve, or apply analytical methods to data contained in the SBT, web services will be developed to make these services accessible through a single portal. Soybase and the SBT will be maintained and updated with new data classes as needed. The Williams 82 physical map and the soybean whole genome sequence, new sequence-based data types in SoyBase, and comparative data from other legume will be integrated and displayed. The project works closely with DOE-JGI to enhance the quality of the soybean whole-genome sequence assembly. This will include analysis of sequence-based genetic markers, comparative analyses with other genomes, and various informatic analyses.
3. Progress Report:
Ontologies have been converted into Open Biological Ontologies (OBO) format and are accessible over the web using the Genetic Model Organism Database (GMOD) AmiGO viewer. This program displays and allows free text searches of the SoyBase ontologies. It is accessible at the SoyBase web site at http://soybase.org/ontology.php. In addition, the SoyBase ontologies were also made viewable and searchable at the OBO Foundry website, BioPortal (http://bioportal.bioontology.org/ontologies/47150?p=terms) and at the Generation Challenge Program Crop Ontology website (http://www.cropontology.org/ontology/SOY/SoyBase Soybean Trait Ontology). The existing SoyBase Simple Semantic Web Architechture and Protocol (SSWAP) services have been upgraded and were incorporated into the iPlant Collaborative’s Semantic Discovery Environment. SoyBase staff curated 44 Quantitative Trait Locus (QTL) references from the literature, representing 844 new QTL records added to the Soybean Breeders Toolbox and to the Soybean Composite Genetic map. This increases the resolution of QTL regions facilitating the discovery of genes responsible for agronomically important traits. Added Williams82 minimum tiling path Bacterial Artificial Chromosome (BAC) clones to the SoyBase genome sequence browser. This allows users to relate a phenotypic character to a BAC clone for further sequence analysis. The SoyBase genome viewer was updated with newly annotated soybean genes. SoyBase staff also added 50,574 National Center for Biotechnology Information (NCBI) RefSeq gene models to the SoyBase genome viewer. These gene models were predicted using a different algorithm and provide additional support to some Joint Genome Institute (JGI) gene models as well as indicate regions of the genomic sequence that need focused attention. Compared the soybean genome against the genome sequence of common bean, for improvement of both genomes. Locations for improvement of soybean for the next round of genome assembly were identified. SoyBase and Legume Information System (LIS) personnel worked with Department of Energy-Joint Genome Institute (DOE-JGI) to revise the soybean gene annotations. In the allied LIS, added features for whole-genome searches and comparisons among each of the sequenced legume genomes: soybean, pigeonpea, Medicago truncatula, and Lotus japonicus. Sequence-search matches to soybean lead to the SoyBase GBrowse viewer for close-up exploration. For improved performance and features, the genome browser software was updated from GBrowse 1 to GBrowse 2. In the genome browser, added these data sets: whole-genome comparisons with pigeonpea; gene sequences and genetic markers from pigeonpea; RNA-Seq gene expression data from a soybean/pathogen experiment; a new set of gene predictions from NCBI. For improved performance and features, updated the genome browser software from GBrowse 1 to GBrowse 2. Worked with DOE-JGI and other soybean researchers to evaluate a revised set of soybean gene predictions. Promoted approximately 1,300 gene models into the revised set, which would not have been included without our review.
1. Gene sequencing and genetic markers for chickpea and pigeonpea, toward improved drought tolerance. Global food security requires crops that are nutritious, versatile, and capable of growing in a wide range of challenging growing environments. ARS researchers in Ames, IA working with the SoyBase and Legume Information System have partnered with scientists at the International Crops Research Institute for the Semi-Arid Tropics (ICRISAT) in India to determine essentially all of the gene sequences for chickpea and pigeonpea, and to design genetic markers for plant breeding work. This work has been used to assist breeders to develop more drought-tolerant crop varieties. Chickpea is grown worldwide, from India to the northern U.S. and Canada, and has a large domestic and international market. Pigeonpea, a protein-rich and highly drought-tolerant relative of soybean, is widely used in India and the Caribbean, and may be suitable as a crop in the southern U.S.
2. Genome sequencing of Medicago truncatula (Barrel medic). Medicago truncatula, a close relative of alfalfa and a more distant relative of soybean and other beans, is used as a biological model because of characteristics such as its small size, short generation time, and relatively simple genetics. ARS researchers in Ames, IA with the SoyBase and Legume Information System have worked with an international team to report the essentially complete genome sequence (the set of all DNA letters in the chromosomes) of M. truncatula. Analysis of the genome sequence and the contained genes shows that although the M. truncatula genome is less than half the size of the soybean genome, it nevertheless has nearly as many genes as soybean, and has accumulated many changes (large and small) since the two species diverged from their common ancestor. Analysis of genes involved in nitrogen fixation shows that this capacity probably evolved from older genes that were involved in plant-fungal interactions. The genome sequence and other findings are expected to have large impacts on the efforts of plant breeders and biotechnologists to develop crop varieties that are able to more efficiently "fix" atmospheric nitrogen to make their own fertilizer, rather than relying on supplemental nitrogen fertilizer which is expensive and can run-off into the environment.
Cannon, S.B., Crow, J.A., Grant, D.M. 2012. SoyBase and the legume information system: accessing information about the soybean and other legume genomes. In: Wilson, R. editor. Designing Soybeans for 21st Century Markets. Urbana, IL: AOCS Press. p. 53-66.