Plant Gene Expression Center  Albany_CA Site Logo
ARS Home About Us Helptop nav spacerContact Us En Espanoltop nav spacer
Printable VersionPrintable Version     E-mail this pageE-mail this page
Agricultural Research Service United States Department of Agriculture
Search
  Advanced Search
 
Programs and Projects
Subjects of Investigation
 

Title: ANALYSIS OF THE GENOME SEQUENCE OF THE FLOWERING PLANT ARABIDOPSIS THALIANA

Authors
item The Arabidopsis Geno, Genome Init - INST GENOMIC RES ROCKVILL
item Kazusa DNA Res, Institute - CHIBA JAPAN
item Ecker, Joseph - UNIV PA PHILA PA
item Theologis, Athanasios
item Davis, Ronald - STANFORD GEN TECH CTR

Submitted to: Nature
Publication Type: Peer Reviewed Journal
Publication Acceptance Date: November 1, 2000
Publication Date: December 14, 2000
Citation: The Arabidopsis Geno, G., Kazusa Dna Res, I., Ecker, J., Theologis, A., Davis, R.W. 2000. Analysis of the genome sequence of the flowering plant arabidopsis thaliana. Nature, 408 796-815.

Interpretive Summary: This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.

Technical Abstract: The flowering plant Arabidopsis thaliana is an important model system for identifying genes and determining their functions. Here we report the analysis of the genomic sequence of Arabidopsis. The sequenced regions cover 115.4 megabases of the 125-megabase genome and extend into centromeric regions. The evolution of Arabidopsis involved a whole-genome duplication, followed by subsequent gene loss and extensive local gene duplications, giving rise to a dynamic genome enriched by lateral gene transfer from a cyanobacterial-like ancestor of the plastid. The genome contains 25,498 genes encoding proteins from 11,000 families, similar to the functional diversity of Drosophila and Caenorhabditis elegans¿ the other sequenced multicellular eukaryotes. Arabidopsis has many families of new proteins but also lacks several common protein families, indicating that the sets of common proteins have undergone differential expansion and contraction in the three multicellular eukaryotes. This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.

   
 
 
Last Modified: 05/20/2013
ARS Home | USDA.gov | Site Map | Policies and Links 
FOIA | Accessibility Statement | Privacy Policy | Nondiscrimination Statement | Information Quality | USA.gov | White House