Search CORE

RegPrecise web services interface: programmatic access to the transcriptional regulatory interactions in bacteria reconstructed by comparative genomics.

Author: Arkin Adam P
Brettin Thomas S
Dehal Paramvir S
Dubchak Inna
Novichkov Pavel S
Novichkova Elena S
Rodionov Dmitry A
Publication venue: eScholarship, University of California
Publication date: 01/01/2012
Field of study

Web services application programming interface (API) was developed to provide a programmatic access to the regulatory interactions accumulated in the RegPrecise database (http://regprecise.lbl.gov), a core resource on transcriptional regulation for the microbial domain of the Department of Energy (DOE) Systems Biology Knowledgebase. RegPrecise captures and visualize regulogs, sets of genes controlled by orthologous regulators in several closely related bacterial genomes, that were reconstructed by comparative genomics. The current release of RegPrecise 2.0 includes >1400 regulogs controlled either by protein transcription factors or by conserved ribonucleic acid regulatory motifs in >250 genomes from 24 taxonomic groups of bacteria. The reference regulons accumulated in RegPrecise can serve as a basis for automatic annotation of regulatory interactions in newly sequenced genomes. The developed API provides an efficient access to the RegPrecise data by a comprehensive set of 14 web service resources. The RegPrecise web services API is freely accessible at http://regprecise.lbl.gov/RegPrecise/services.jsp with no login requirements

CiteSeerX

Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea

Author: Koonin Eugene V
Makarova Kira S
Novichkov Pavel S
Sorokin Alexander V
Wolf Yuri I
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. One of the practical strategies involves construction of refined COGs for phylogenetically compact subsets of genomes. Results New Archaeal Clusters of Orthologous Genes (arCOGs) were constructed for 41 archaeal genomes (13 Crenarchaeota, 27 Euryarchaeota and one Nanoarchaeon) using an improved procedure that employs a similarity tree between smaller, group-specific clusters, semi-automatically partitions orthology domains in multidomain proteins, and uses profile searches for identification of remote orthologs. The annotation of arCOGs is a consensus between three assignments based on the COGs, the CDD database, and the annotations of homologs in the NR database. The 7538 arCOGs, on average, cover ~88% of the genes in a genome compared to a ~76% coverage in COGs. The finer granularity of ortholog identification in the arCOGs is apparent from the fact that 4538 arCOGs correspond to 2362 COGs; ~40% of the arCOGs are new. The archaeal gene core (protein-coding genes found in all 41 genome) consists of 166 arCOGs. The arCOGs were used to reconstruct gene loss and gene gain events during archaeal evolution and gene sets of ancestral forms. The Last Archaeal Common Ancestor (LACA) is conservatively estimated to possess 996 genes compared to 1245 and 1335 genes for the last common ancestors of Crenarchaeota and Euryarchaeota, respectively. It is inferred that LACA was a chemoautotrophic hyperthermophile that, in addition to the core archaeal functions, encoded more idiosyncratic systems, e.g., the CASS systems of antivirus defense and some toxin-antitoxin systems. Conclusion The arCOGs provide a convenient, flexible framework for functional annotation of archaeal genomes, comparative genomics and evolutionary reconstructions. Genomic reconstructions suggest that the last common ancestor of archaea might have been (nearly) as advanced as the modern archaeal hyperthermophiles. ArCOGs and related information are available at: <url>ftp://ftp.ncbi.nih.gov/pub/koonin/arCOGs/</url>. Reviewers This article was reviewed by Peer Bork, Patrick Forterre, and Purificacion Lopez-Garcia.</p

Springer - Publisher Connector

Directory of Open Access Journals

Distinct Patterns of Expression and Evolution of Intronless and Intron-Containing Mammalian Genes

Author: Koonin Eugene V.
Novichkov Pavel S.
Ogurtsov Aleksey Y.
Shabalina Svetlana A.
Spiridonov Alexey N.
Spiridonov Nikolay A.
Publication venue: Oxford University Press
Publication date: 01/04/2010
Field of study

Comparison of expression levels and breadth and evolutionary rates of intronless and intron-containing mammalian genes shows that intronless genes are expressed at lower levels, tend to be tissue specific, and evolve significantly faster than spliced genes. By contrast, monomorphic spliced genes that are not subject to detectable alternative splicing and polymorphic alternatively spliced genes show similar statistically indistinguishable patterns of expression and evolution. Alternative splicing is most common in ancient genes, whereas intronless genes appear to have relatively recent origins. These results imply tight coupling between different stages of gene expression, in particular, transcription, splicing, and nucleocytosolic transport of transcripts, and suggest that formation of intronless genes is an important route of evolution of novel tissue-specific functions in animals

DSpace@MIT

RegTransBase—a database of regulatory sequences and interactions in a wide range of prokaryotic genomes

Author: Arkin Adam
Cipriano Michael J.
Dubchak Inna
Gelfand Mikhail S.
Kazakov Alexei E.
Minovitsky Simon
Mironov Andrey A.
Novichkov Pavel S.
Vinogradov Dmitry V.
Publication venue: Oxford University Press
Publication date: 01/07/2006
Field of study

RegTransBase is a manually curated database of regulatory interactions in prokaryotes that captures the knowledge in public scientific literature using a controlled vocabulary. Although several databases describing interactions between regulatory proteins and their binding sites are already being maintained, they either focus mostly on the model organisms Escherichia coli and Bacillus subtilis or are entirely computationally derived. RegTransBase describes a large number of regulatory interactions reported in many organisms and contains the following types of experimental data: the activation or repression of transcription by an identified direct regulator, determining the transcriptional regulatory function of a protein (or RNA) directly binding to DNA (RNA), mapping or prediction of a binding site for a regulatory protein and characterization of regulatory mutations. Currently, RegTransBase content is derived from about 3000 relevant articles describing over 7000 experiments in relation to 128 microbes. It contains data on the regulation of about 7500 genes and evidence for 6500 interactions with 650 regulators. RegTransBase also contains manually created position weight matrices (PWM) that can be used to identify candidate regulatory sites in over 60 species. RegTransBase is available at

UNT Digital Library

RegPrecise: a database of curated genomic inferences of transcriptional regulatory interactions in prokaryotes

Author: Adam P. Arkin
Alkema
Alm
Balleza
Baumbach
Conlan
Crooks
Dmitry A. Rodionov
Elena S. Novichkova
Erill
Fredrickson
Gama-Castro
Gelfand
Gelfand
Gonzalez
Grote
Guia
Huang
Inna Dubchak
Jacques
Kazakov
Kazakov
Krawczyk
Makarova
Mikhail S. Gelfand
Mironov
Mwangi
Olga N. Laikova
Overbeek
Panina
Pavel S. Novichkov
Permina
Ravcheev
Rodionov
Rodionov
Rodionov
Rodionov
Rodionov
Rodionov
Rodionov
Rodionov
Rodionov
Rodionov
Sierro
Wels
Xu
Yang
Publication venue: Oxford University Press
Publication date: 01/01/2010
Field of study

The RegPrecise database (http://regprecise.lbl.gov) was developed for capturing, visualization and analysis of predicted transcription factor regulons in prokaryotes that were reconstructed and manually curated by utilizing the comparative genomic approach. A significant number of high-quality inferences of transcriptional regulatory interactions have been already accumulated for diverse taxonomic groups of bacteria. The reconstructed regulons include transcription factors, their cognate DNA motifs and regulated genes/operons linked to the candidate transcription factor binding sites. The RegPrecise allows for browsing the regulon collections for: (i) conservation of DNA binding sites and regulated genes for a particular regulon across diverse taxonomic lineages; (ii) sets of regulons for a family of transcription factors; (iii) repertoire of regulons in a particular taxonomic group of species; (iv) regulons associated with a metabolic pathway or a biological process in various genomes. The initial release of the database includes ∼11 500 candidate binding sites for ∼400 orthologous groups of transcription factors from over 350 prokaryotic genomes. Majority of these data are represented by genome-wide regulon reconstructions in Shewanella and Streptococcus genera and a large-scale prediction of regulons for the LacI family of transcription factors. Another section in the database represents the results of accurate regulon propagation to the closely related genomes

Comparative genomics of metabolic capacities of regulons controlled by cis-regulatory RNA motifs in bacteria

Author: A Gutierrez-Preciado
A Gutierrez-Preciado
A Serganov
AD Garst
AG Vitreschak
AG Vitreschak
AG Vitreschak
AG Vitreschak
AR Ferre-D’Amare
C Abreu-Goodger
C Yanofsky
DA Rodionov
DA Rodionov
DA Rodionov
DA Rodionov
DA Rodionov
DA Rodionov
Dmitry A Rodionov
E Nudler
EP Nawrocki
Eric I Sun
JE Barrick
K Deiorio-Haggar
L Bastet
M Kanehisa
M Kwon
M Magrane
M Wels
Marat D Kazanov
MD Kazanov
Milton H Saier
MJ Cromie
MM Meyer
NJ Green
Pavel S Novichkov
PJ McCown
PP Gardner
PS Dehal
PS Novichkov
PS Novichkov
PS Novichkov
R Overbeek
RR Breaker
S Li
SA Leyn
Semen A Leyn
SR Eddy
SW Burge
TH Chang
WC Winkler
Y Fu
Z Weinberg
Z Weinberg
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/09/2013
Field of study

BACKGROUND: In silico comparative genomics approaches have been efficiently used for functional prediction and reconstruction of metabolic and regulatory networks. Riboswitches are metabolite-sensing structures often found in bacterial mRNA leaders controlling gene expression on transcriptional or translational levels. An increasing number of riboswitches and other cis-regulatory RNAs have been recently classified into numerous RNA families in the Rfam database. High conservation of these RNA motifs provides a unique advantage for their genomic identification and comparative analysis. RESULTS: A comparative genomics approach implemented in the RegPredict tool was used for reconstruction and functional annotation of regulons controlled by RNAs from 43 Rfam families in diverse taxonomic groups of Bacteria. The inferred regulons include ~5200 cis-regulatory RNAs and more than 12000 target genes in 255 microbial genomes. All predicted RNA-regulated genes were classified into specific and overall functional categories. Analysis of taxonomic distribution of these categories allowed us to establish major functional preferences for each analyzed cis-regulatory RNA motif family. Overall, most RNA motif regulons showed predictable functional content in accordance with their experimentally established effector ligands. Our results suggest that some RNA motifs (including thiamin pyrophosphate and cobalamin riboswitches that control the cofactor metabolism) are widespread and likely originated from the last common ancestor of all bacteria. However, many more analyzed RNA motifs are restricted to a narrow taxonomic group of bacteria and likely represent more recent evolutionary innovations. CONCLUSIONS: The reconstructed regulatory networks for major known RNA motifs substantially expand the existing knowledge of transcriptional regulation in bacteria. The inferred regulons can be used for genetic experiments, functional annotations of genes, metabolic reconstruction and evolutionary analysis. The obtained genome-wide collection of reference RNA motif regulons is available in the RegPrecise database (http://regprecise.lbl.gov/)

Springer - Publisher Connector