12 research outputs found
The RAST Server: Rapid Annotations using Subsystems Technology
<p>Abstract</p> <p>Background</p> <p>The number of prokaryotic genome sequences becoming available is growing steadily and is growing faster than our ability to accurately annotate them.</p> <p>Description</p> <p>We describe a fully automated service for annotating bacterial and archaeal genomes. The service identifies protein-encoding, rRNA and tRNA genes, assigns functions to the genes, predicts which subsystems are represented in the genome, uses this information to reconstruct the metabolic network and makes the output easily downloadable for the user. In addition, the annotated genome can be browsed in an environment that supports comparative analysis with the annotated genomes maintained in the SEED environment.</p> <p>The service normally makes the annotated genome available within 12–24 hours of submission, but ultimately the quality of such a service will be judged in terms of accuracy, consistency, and completeness of the produced annotations. We summarize our attempts to address these issues and discuss plans for incrementally enhancing the service.</p> <p>Conclusion</p> <p>By providing accurate, rapid annotation freely to the community we have created an important community resource. The service has now been utilized by over 120 external users annotating over 350 distinct genomes.</p
The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes
The release of the 1000(th) complete microbial genome will occur in the next two to three years. In anticipation of this milestone, the Fellowship for Interpretation of Genomes (FIG) launched the Project to Annotate 1000 Genomes. The project is built around the principle that the key to improved accuracy in high-throughput annotation technology is to have experts annotate single subsystems over the complete collection of genomes, rather than having an annotation expert attempt to annotate all of the genes in a single genome. Using the subsystems approach, all of the genes implementing the subsystem are analyzed by an expert in that subsystem. An annotation environment was created where populated subsystems are curated and projected to new genomes. A portable notion of a populated subsystem was defined, and tools developed for exchanging and curating these objects. Tools were also developed to resolve conflicts between populated subsystems. The SEED is the first annotation environment that supports this model of annotation. Here, we describe the subsystem approach, and offer the first release of our growing library of populated subsystems. The initial release of data includes 180 177 distinct proteins with 2133 distinct functional roles. This data comes from 173 subsystems and 383 different organisms
Recommended from our members
An integrative approach to energy, carbon, and redox metabolism in the cyanobacterium Synechocystis sp. PCC 6803
The team of the Fellowship for Interpretation of Genomes (FIG) under the leadership of Ross Overbeek, began working on this Project in November 2003. During the previous year, the Project was performed at Integrated Genomics Inc. A transition from the industrial environment to the public domain prompted us to adjust some aspects of the Project. Notwithstanding the challenges, we believe that these adjustments had a strong positive impact on our deliverables. Most importantly, the work of the research team led by R. Overbeek resulted in the deployment of a new open source genomic platform, the SEED (Specific Aim 1). This platform provided a foundation for the development of CyanoSEED a specialized portal to comparative analysis and metabolic reconstruction of all available cyanobacterial genomes (Specific Aim 3). The SEED represents a new generation of software for genome analysis. Briefly, it is a portable and extendable system, containing one of the largest and permanently growing collections of complete and partial genomes. The complete system with annotations and tools is freely available via browsing or via installation on a user's Mac or Linux computer. One of the important unique features of the SEED is the support of metabolic reconstruction and comparative genome analysis via encoding and projection of functional subsystems. During the project period, the FIG research team has validated the new software by developing a significant number of core subsystems, covering many aspects of central metabolism (Specific Aim 2), as well as metabolic areas specific for cyanobacteria and other photoautotrophic organisms (Specific Aim 3). In addition to providing a proof of technology and a starting point for further community-based efforts, these subsystems represent a valuable asset. An extensive coverage of central metabolism provides the bulk of information required for metabolic modeling in Synechocystis sp.PCC 6803. Detailed analysis of several subsystems covering energy, carbon, and redox metabolism in the Synechocystis sp. PCC 6803 and other cyanobacteria has been performed (Specific Aim 4). The main objectives for this year (adjusted to reflect a new, public domain, setting of the Project research team) were: Aim 1. To develop, test, and deploy a new open source system, the SEED, for integrating community-based annotation, and comparative analysis of all publicly available microbial genomes. Develop a comprehensive genomic database by integrating within SEED all publicly available complete and nearly complete genome sequences with special emphasis on genomes of cyanobacteria, phototrophic eukaryotes, and anoxygenic phototrophic bacteria--invaluable for comparative genomic studies of energy and carbon metabolism in Synechocystis sp. PCC 6803. Aim 2. To develop the SEED's biological content in the form of a collection of encoded Subsystems largely covering the conserved cellular machinery in prokaryotes (and central metabolic machinery in eukaryotes). Aim 3. To develop, utilizing core SEED technology, the CyanoSEED--a specialized WEB portal for community-based annotation, and comparative analysis of all publicly available cyanobacterial genomes. Encode the set of additional subsystems representing key metabolic transformations in cyanobacteria and other photoautotrophs. We envisioned this resource as complementary to other public access databases for comparative genomic analysis currently available to the cyanobacterial research community. Aim 4. Perform in-depth analysis of several subsystems covering energy, carbon, and redox metabolism in the Synechocystis sp. PCC 6803 and all other cyanobacteria with available genome sequences. Reveal inconsistencies and gaps in the current knowledge of these subsystems. Use functional and genome context analysis tools in CyanoSEED to predict, whenever possible, candidate genes for inferred functional roles. To disseminate freely these conjectures and predictions by publishing them on CyanoSEED (http://cyanoseed.thefig.info/) and the Subsystems Forum (http://brucella.uchicago.edu/SubsystemForum/) in order to facilitate experimental analysis by our collaborator on this Project and by other experimentalists working in various field of cyanobacterial physiology and biotechnology
Relationship between hepatic phenotype and changes in gene expression in cytochrome P450 reductase (POR) null mice
Cytochrome P450 reductase is the unique electron donor for microsomal cytochrome P450s; these enzymes play a major role in the metabolism of endogenous and xenobiotic compounds. In mice with a liver-specific deletion of cytochrome P450 reductase, hepatic cytochrome P450 activity is ablated, with consequent changes in bile acid and lipid homoeostasis. In order to gain insights into the metabolic changes resulting from this phenotype, we have analysed changes in hepatic mRNA expression using microarray analysis and real-time PCR. In parallel with the perturbations in bile acid levels, changes in the expression of key enzymes involved in cholesterol and lipid homoeostasis were observed in hepatic cytochrome P450 reductase null mice. This was characterized by a reduced expression of Cyp7b1, and elevation of Cyp7a1 and Cyp8b1 expression. The levels of mRNAs for other cytochrome P450 genes, including Cyp2b10, Cyp2c29, Cyp3a11 and Cyp3a16, were increased, demonstrating that endogenous factors play a role in regulating the expression of these proteins and that the increases are due, at least in part, to altered levels of transcripts. In addition, levels of mRNAs encoding genes involved in glycolysis and lipid transport were also increased; the latter may provide an explanation for the increased hepatic lipid content observed in the hepatic null mice. Serum testosterone and oestradiol levels were lowered, accompanied by significantly decreased expression of Hsd3b2 (3β-hydroxy-Δ(5)-steroid dehydrogenase-2), Hsd3b5 (3β-hydroxy-Δ(5)-steroid dehydrogenase-5) and Hsd11b1 (11β-hydroxysteroid dehydrogenase type 1), key enzymes in steroid hormone metabolism. These microarray data provide important insights into the control of metabolic pathways by the cytochrome system