Search CORE

Directory of Open Access Journals

The Francis Crick Institute

STRING and STITCH: known and predicted interactions between proteins and chemicals

Author: Christian von Mering
Lars J. Jensen
Manuel Stark
Michael Kuhn
Peer Bork
Samuel Chaffron
Publication venue
Publication date: 06/09/2008
Field of study

Information on protein-protein and protein-chemical interactions is essential for understanding cellular functions. The STRING and STITCH web resources integrate interaction evidence derived from pathways, automatic literature mining, primary experimental data, and genomic context. The resulting interaction networks cover 1.5 million proteins from 373 organisms and 68,000 chemicals

Nature Precedings

Just-in-time assembly of cell-cycle protein complexes

Author: Lars J. Jensen
Peer Bork
S&#xf8
Thomas S. Jensen
Ulrik de Lichtenberg
Publication venue
Publication date: 09/09/2008
Field of study

Our comparative analysis of eukaryotic cell-cycle complexes reveals that the identity of the periodically expressed subunits differs significantly between organisms and is often mirrored by changes in cell-cycle-dependent phosphorylation of the protein products. This indicates that many different solutions have evolved for just-in-time assembly of the same molecular machines

Nature Precedings

eggNOG: automated construction and annotation of orthologous groups of genes

Author: Bork P.
Doerks T.
Jensen L J.
Julien P.
Kuhn M.
Muller J.
von Mering C.
Publication venue
Publication date: 02/08/2017
Field of study

The identification of orthologous genes forms the basis for most comparative genomics studies. Existing approaches either lack functional annotation of the identified orthologous groups, hampering the interpretation of subsequent results, or are manually annotated and thus lag behind the rapid sequencing of new genomes. Here we present the eggNOG database ('evolutionary genealogy of genes: Non-supervised Orthologous Groups'), which contains orthologous groups constructed from Smith-Waterman alignments through identification of reciprocal best matches and triangular linkage clustering. Applying this procedure to 312 bacterial, 26 archaeal and 35 eukaryotic genomes yielded 43 582 course-grained orthologous groups of which 9724 are extended versions of those from the original COG/KOG database. We also constructed more fine-grained groups for selected subsets of organisms, such as the 19 914 mammalian orthologous groups. We automatically annotated our non-supervised orthologous groups with functional descriptions, which were derived by identifying common denominators for the genes based on their individual textual descriptions, annotated functional categories, and predicted protein domains. The orthologous groups in eggNOG contain 1 241 751 genes and provide at least a broad functional description for 77% of them. Users can query the resource for individual genes via a web interface or download the complete set of orthologous groups at http://eggnog.embl.d

RERO DOC Digital Library

STITCH 3: zooming in on protein–chemical interactions

Author: A. Franceschini
Berman
C. von Mering
Chen
D. Szklarczyk
Jensen
Kalinina
Kapitzky
Kuhn
L. J. Jensen
M. Kuhn
Okuno
P. Bork
Rognan
Roth
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

To facilitate the study of interactions between proteins and chemicals, we have created STITCH, an aggregated database of interactions connecting over 300 000 chemicals and 2.6 million proteins from 1133 organisms. Compared to the previous version, the number of chemicals with interactions and the number of high-confidence interactions both increase 4-fold. The database can be accessed interactively through a web interface, displaying interactions in an integrated network view. It is also available for computational studies through downloadable files and an API. As an extension in the current version, we offer the option to switch between two levels of detail, namely whether stereoisomers of a given compound are shown as a merged entity or as separate entities. Separate display of stereoisomers is necessary, for example, for carbohydrates and chiral drugs. Combining the isomers increases the coverage, as interaction databases and publications found through text mining will often refer to compounds without specifying the stereoisomer. The database is accessible at http://stitch.embl.de/

CiteSeerX

Copenhagen University Research Information System

ZORA

STITCH 4: integration of protein-chemical interactions with user data

Author: Blicher Thomas H.
Bork Peer
Jensen Lars J.
Kuhn Michael
Pletscher-Frankild Sune
Szklarczyk Damian
von Mering Christian
Publication venue
Publication date: 02/08/2017
Field of study

STITCH is a database of protein-chemical interactions that integrates many sources of experimental and manually curated evidence with text-mining information and interaction predictions. Available at http://stitch.embl.de, the resulting interaction network includes 390 000 chemicals and 3.6 million proteins from 1133 organisms. Compared with the previous version, the number of high-confidence protein-chemical interactions in human has increased by 45%, to 367 000. In this version, we added features for users to upload their own data to STITCH in the form of internal identifiers, chemical structures or quantitative data. For example, a user can now upload a spreadsheet with screening hits to easily check which interactions are already known. To increase the coverage of STITCH, we expanded the text mining to include full-text articles and added a prediction method based on chemical structures. We further changed our scheme for transferring interactions between species to rely on orthology rather than protein similarity. This improves the performance within protein families, where scores are now transferred only to orthologous proteins, but not to paralogous proteins. STITCH can be accessed with a web-interface, an API and downloadable file

RERO DOC Digital Library

The room acoustic rendering equation

Author: Bork I.
Bork I.
Cox T. J.
Dutré P.
Immel D. S.
Jensen H. W.
Kuttruff H.
Lauri Savioja
Sami Kiminki
Samuel Siltanen
Svensson U.
Tapio Lokki
Publication venue: 'Acoustical Society of America (ASA)'
Publication date
Field of study

High-resolution transcription atlas of the mitotic cell cycle in budding yeast.

Author: Bork Peer
Granovskaia Marina V
Huber Wolfgang
Jensen Lars J
Ning Ye
Ritchie Matthew E
Steinmetz Lars M
Toedling Joern
Publication venue: Genome Biol
Publication date: 01/01/2010
Field of study

RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are.BACKGROUND: Extensive transcription of non-coding RNAs has been detected in eukaryotic genomes and is thought to constitute an additional layer in the regulation of gene expression. Despite this role, their transcription through the cell cycle has not been studied; genome-wide approaches have only focused on protein-coding genes. To explore the complex transcriptome architecture underlying the budding yeast cell cycle, we used 8 bp tiling arrays to generate a 5 minute-resolution, strand-specific expression atlas of the whole genome. RESULTS: We discovered 523 antisense transcripts, of which 80 cycle or are located opposite periodically expressed mRNAs, 135 unannotated intergenic non-coding RNAs, of which 11 cycle, and 109 cell-cycle-regulated protein-coding genes that had not previously been shown to cycle. We detected periodic expression coupling of sense and antisense transcript pairs, including antisense transcripts opposite of key cell-cycle regulators, like FAR1 and TAF2. CONCLUSIONS: Our dataset presents the most comprehensive resource to date on gene expression during the budding yeast cell cycle. It reveals periodic expression of both protein-coding and non-coding RNA and profiles the expression of non-annotated RNAs throughout the cell cycle for the first time. This data enables hypothesis-driven mechanistic studies concerning the functions of non-coding RNAs

Copenhagen University Research Information System

Apollo (Cambridge)

University of Melbourne Institutional Repository

Systematic Association of Genes to Phenotypes by Genome and Literature Mining

Author: Andrade Miguel A
Bork Peer
Doerks Tobias
Hooper Sean D
Jensen Lars J
Kaczanowski Szymon
Korbel Jan O
Perez-Iratxeta Carolina
Publication venue: Public Library of Science
Publication date: 05/04/2005
Field of study

One of the major challenges of functional genomics is to unravel the connection between genotype and phenotype. So far no global analysis has attempted to explore those connections in the light of the large phenotypic variability seen in nature. Here, we use an unsupervised, systematic approach for associating genes and phenotypic characteristics that combines literature mining with comparative genome analysis. We first mine the MEDLINE literature database for terms that reflect phenotypic similarities of species. Subsequently we predict the likely genomic determinants: genes specifically present in the respective genomes. In a global analysis involving 92 prokaryotic genomes we retrieve 323 clusters containing a total of 2,700 significant gene–phenotype associations. Some clusters contain mostly known relationships, such as genes involved in motility or plant degradation, often with additional hypothetical proteins associated with those phenotypes. Other clusters comprise unexpected associations; for example, a group of terms related to food and spoilage is linked to genes predicted to be involved in bacterial food poisoning. Among the clusters, we observe an enrichment of pathogenicity-related associations, suggesting that the approach reveals many novel genes likely to play a role in infectious diseases

Public Library of Science (PLOS)

Directory of Open Access Journals